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IMPROVED S YSIEM FOR REGULATION OF TRANSGENE EXPRESSION 

TECHNICAL FIELD 

The present invention relates to an improved regulated-gene-expression system 
characterized by low basal expression and high specific inducibility, and to the use of 
5 regulated gene expression for control of immunoreactivity in gene therapy. Priority is 
claimed from US Provisional Application Serial No. 60/235,030, filed September 25, 
2000, US Provisional Application Serial No. 60/260,781, filed January 10, 2001, and US 
Provisional Application Serial No. 60/278,281, filed March 23, 2001, which are hereby 
incorporated by reference as if fully set forth herein. 

10 BACKGROUND OF THE INVENTION 

One goal of gene therapy is to deliver genes to somatic tissue in a manner that 
provides correction of inborn or acquired deficiencies and imbalances. Gene-based drug 
delivery offers a number of advantages over administration of recombinant proteins. 
These advantages include: conservation of native protein structure; improved biological 

15 activity; prolonged exposure to protein in the therapeutic range; prolonged availability of 
protein fi:om each administration; avoidance of systemic toxicities; and avoidance of 
infectious and toxic impurities. 

Efforts to develop gene-based delivery of therapeutic proteins such as, for 
example, erythropoietin ("EPO") for treatment of anemias of various etiologies have been 

20 underway for almost a decade. EPO is produced primarily in the kidney in adults and is 
responsible for stimulating the production of red blood cells from progenitor cells. In 
patients with renal insufficiency, compromised EPO production results in anemia. Low 
serum EPO levels may also be seen in anemic patients with cancer, as well as those with 
rheumatoid arthritis, HIV infection, ulcerative colitis, sickle ceU anemia, and in anemia of 

25 prematurity. However, certain proteins such as erythropoietin may have adverse effects if 
administration is not carefully controlled. For example, unregulated exposure to 
erythropoietin may result in life-threatening erythroid hyperplasia. In humans suffering 
from polycythemia, or a high red-blood cell count, prophylactic phlebotomy or blood 
removal is employed to maintain a hematocrit level below 45%. In animal models of 

30 erythrbpoietin-gene therapy, ururegulated expression systems routinely result in hematocrit 
levels in the range of 60 - 85%. (Savino, R., et al.. International Patent Publication No. 
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WO0009713, "Adenoviral Vectors Encoding Erythropoietin and Their Use in Gene 
Therapy"; Podsakoff, G., et al., U.S. Patent No. 5,846,528; Svensson, E., et al., Hum Gene 
Ther 8(15): 1797 (1997); Lemieux, P., et al.. Gene Therapy 7:986 (2000)). Periodic blood 
removal may be necessitated to avoid stroke and other severe polycythemic pathologies. 
5 (Zhou, S., et al.. Gene Therapy 5, 665 (1998)). 

In unregulated viral-vector-based erythropoietin-gene-delivery systems, the 
resulting hematocrit has been found to be viral dose dependent. Attempted adjustment of 
the hematocrit has been through empirical titration of the administered viral dose. 
(Kessler, et al., Proc. Natl. Acid. Sci. 93:14082 (1996)). Control of adverse and life 

10 threatening side effects through viral-dose titration, however, does not provide a 
satisfactory margin of therapeutic safety. What is needed for expression of proteins such 
as erythropoietin is the ability to closely regulate expression of the introduced gene across 
a range of administration dosages. 

Several regulated gene-expression systems for erythropoietin have been explored. 

15 For example, Rizzuto, G., et al., Proc. Natl. Acad. Sci. 96:6417 (1999), utilized a 
tetracycline-inducible promoter to drive expression of a mouse EPO gene from plasmid 
DNA administered in saline with electroporation. As a consequence of the high basal 
level of expression in this system, however, the amount of plasmid DNA that could be 
administered for controlled expression had to be empirically titrated to a sufficiently low 

20 delivery amount. 

Regulated viral vector systems for. EPO delivery have also been described. For 
example, Rendahl, K, et al.. Nature Biotechnology 16:757 (1998), reported a regulated 
two-viral vector system in which administration of tetracycline is designed to down 
regulate EPO production through the interaction of the two-vector gene products. 

25 However, control of EPO production was dependent on administered viral dose with 
gradual uncontrolled rise in hematocrit levels at higher viral doses. Ye, X., et al.. Science 
283:88 (1999) also reported the use of a two-viral vector system designed to be regulated 
by rapamydn. Although plasma EPO levels could be regulated by rapamycin, the 
hematocrit level could not, thereby indicating a basal level of EPO expression sufficient to 

30 stimulate a maximal increase in hematocrit. 

In addition, the use of viral vectors , is complicated by the generation of immune 
responses to the vector in immunocompetent hosts. As a consequence, viral vectors are 
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considered to have limited readministration potential. Manning, W. et al.. International 
Published Application WO9906562, "Method Enabling Readministration of AAV Vector 
Via Xnununosuppression of Host", described efforts to control this phenomena through the 
use of transient immunosuppression at the time of vector delivery. But 
5 immunosuppression, in general, may lead to undesirable side effects. 

The use of a viral vector for ex vivo transformation of fibroblasts to provide a 
mutated steroid-hormone-regulated system of erythropoietin gene expression has also been 
reported. (Serguera, C. et al., Human Gene Therapy 10:375 (1999)). However, induction 
of gene expression by mifepristone resulted in polycythemia that was not reversible upon 

10 cessation of mifepristone treatment. What is needed is an improved regulated system 
where increases in hematocrit are not obtained in the absence of specific induction. 

Recombinant interferon alpha "IFN-alpha" is the primary treatment for chronic 
hepatitis C virus infection. The current best treatment regimen (interferon with ribavirin) 
has a relatively low response rate that is attributed in part to the short half-life of interferon 

15 alpha in the circulation. . Emerging therapies are interferons with covalenfly attached 
polyethylene glycol moieties (peginterferon) that are shown to have a longer half-life, 
sustained absorption and a slower rate of clearance. Clinical trials have indicated that use 
of peginterferon given once weekly is more effective than using non-modified interfeon 
three times weekly. However, aU of the routinely injected IFN-alpha protein therapies are 

20 associated with substantial side effects that result in part ftom the high levels of interferon 
that are obtained by bolus injection. What is needed is a long term continuous and 
consistent expression of low circulating levels of INF-alpha such that a sufficient anti-viral 
level is obtained without toxic peak levels. A potential method of achieving this goal with 
a minimum number of treatments is gene therapy. An adenoviral delivery system for 

25 expression of interferon alpha firom the liver has been reported to provide protection of the 
liver from a hepatitis virus infection in a mouse model. Aurisicchio et al., J Virol 2000 
May; 74(10):4816-23. However, because of potential adverse effects with uncontrolled 
interferon expression, the ability to regulate expression of the interferon may be required. 
What is needed in this context is a tightly regulated gene expression system for interferon 

30 alpha whereby induction can be obtained through administration of a non-toxic small 
molecule inducer. 
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Furthennoie, the etiologies of many disease states are characterized by expression 
of a mutated protein or lack of protein expression due to a defect in one or more genes. 
Current treatment regimens include administration of human-derived protein or 
recombinant protein products to supplement the loss of endogenously produced protein. 

5 These proteins when adnainistered are often viewed by the host as foreign, leading to the 
generation of antibodies to the administered protein that renders the treatment regimen 
ineffective. One example of a class of diseases that are due to a genetic absence of 
functional protein is hemophilia. Hemophilia A and B are caused by functional 
deficiencies in Factor Vin and Factor DC respectively. Hemophilic patients have a high 

10 incidence of developing inhibitors to replacement factors and much effort is focused on 
how to avoid this complication. 

It has been demonstrated that expression of foreign proteins using a gene therapy 
approach can result in an inunune response against the foreign protein. This response can 
be cellular, humoral or both and can result in rapid loss of vector-^bearing cells (Fields et 

15 al., Mol Ther 1(3):225 (2000); Song et al. Hum Gene Ther 8(10):1207 (1997); Michou, et 
al.. Gene Ther 4(5):473 (1997); Dai et al., Ptt)c Natl Acad Sci USA 92(5):1401 (1995)). 
As an example of the effects of foreign transgene expression, when recombinant human 
EPO ("hEPO") transgenes are delivered to mice, immune responses to the foreign 
transgene product can neutralize elevations in hematocrit level, and antibody cross- 

20 reactivity to endogenous EPO can result in erythroid hypoplasia that may lead to fatal 
anemia. (Tripathy, et al. Nat Med. 2:545 (1996); (Kessler, et al., Proc. Natl. Acid. Sci. 
93:14082 (1996)). 

Hence, what is needed is an improved, regulated gene expression system having 
extremely low levels of basal expression while retaining high indudbility. What is further 
25 needed is a system for minimizing the potential for developing of an immune response to 
therapeutic gene products. 

SUMMARY OF THE INVENTION 
The present invention provides an improved molecular-switch, inducible- 
30 expression system for regulating the expression of a nucleic acid sequence in gene therapy 
under conditions in which tight control of expression is of particular hnportance. In one 
aspect of the invention, a system is provided wherein expression of the gene to be induced 
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is characterized by low or undetectable expression or biological effect in the absence of 
the inducer, but in the presence of the inducer, is characterized by efficient induction of 
expression or biological effect. In another aspect of the present invention, a method is 
provided that induces a measure of tolerance to transgenic proteins, thus making long-term 
5 administration of the protein by gene therapy or recombinant protein possible and 
effective. 

In one embodiment of the invention, the molecular-switch, inducible-expression 
system comprises two nucleic acid or expression cassettes. The first expression cassette 
includes a promoter driving the expression of a molecular switch protein. The molecular 

10 switch protein is a chimeric or fusion protein that includes a mutated DNA binding 
domain characterized by a modification that eliminates a domain having a potential for 
autodimerization in the absence of an inducer while retaining those domains required for 
recognition of its cognate DNA sequence on the promoter of the second expression 
cassette. In one embodiment the DNA binding domain is a truncated GAL-4 DNA 

15 binding domain. The fusion protein further comprises a transactivation domain, and a 
mutated ligand-binding domain of a steroid-hormone receptor capable of being activated 
by a non-natural ligand inducer such as mifepristone* In a one embodiment, the promoter 
is a tissue-specific promoter such as a-actin promoter specific for muscle tissues. The 
first expression cassette may also include 5' untranslated regions, synthetic introns, and 

20 poly (A) signals that increase the fidelity and level of expression of the molecular switch 
gene. The second expression cassette includes a transgene encoding a desired gene 
product controlled by an inducible promoter comprising GALr4 DNA-binding sites linked 
to a minimal promoter. The second expression cassette may also include 5* untranslated 
legions, syntiietic introns, and poly (A) signals that increase the fidelity and level of 

25 expression of the transgene to be induced. 

In another embodiment of the invention, the inducible expression system is applied 
in vivo to effect expression of a transgene for gene therapy purposes. In one embodiment 
the. inducible expression system is formulated with non-ionic or anionic polymers and 
injected into an animal or human. Enhancement of transfection in vivo may be obtained 

30 with in vivo electroporation. In addition to using a tightly regulated expression system, a 
method is provided to minimize the immune response of the animal to the transgene 
product or any other introduced nucleic acids and proteins. Preferably, after 
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administration of the expression system, the induction of the expression system occurs 
after the animal's initial inmiune reaction to the injection and electroporation has subsided. 
For example, a lag time between the administration of the expression system and the 
inducer may be at least 12 days, more preferably, at least 20 days, or most preferably, 
5 greater than 50 days. Furthermore, the method may also include the administration of the 
inducer using a pulsatile program that further enhances the immunotolerance of the 
animal. 

In another embodiment of the invention, the inducible expression system as 
introduced in an animal may be characterized by the ability to repetitively effect a 
10 biological response using repetitive administration of an inducer of the expression system. 
In a preferred embodiment, the biological response to the inducer is maintained over a 
period of at least one year using only a single administration of the expression system. 

Other and further objects, features and advantages will be apparent from the 
following description of the presently preferred embodiments of the invention which are 
15 given for the purposes of disclosure when taken in conjunction with the accompanying 
drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 depicts the regulation of murine EPO expression in mice for 1 year using 
a prior expression-regulated system. 
20 Figure 2 is a schematic representation of the nucleic acid structures of an improved 

expression-regulated system comprising a GENESWITCT® regulator plasmid and an 
inducible EPO plasmid according to one embodiment of the present invention. Figure 2 
also depicts the interaction between the gene product (i.e. regulator protein) of the 
GENESWITCH® plasmid and an inducer (e.g. anti-progestin) to induce the expression of 
23 EPO encoded by the inducible EPO plasmid. 

Figure 3 depicts the consensus intron structure SEQ. ID. NO. 6 and the sequence of 
a synthetic intron SEQ. ID. NO. 5 according to one embodimrat of the present invention. 

Figure 4 depicts the sequence of a particular synthetic intron, IVS8 SEQ. ID. NO. 
7, according to one embodiment of the present invention. 
30 Figure 5 depicts the amino acid sequence and structure of the GAL-4 DNA binding 

domain, amino acid residues 1-93. SEQ. ID. NO. 10. 
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Figure 6 depicts the comparison between the basal expression of hiutnan EPO in 
COS-1 cells using a CMV-controUed (constitutive) hHPO expression plasmid and various 
inducible EPO plasmids (pEP1556 (intact inducible promoter) and pEP1596 (initiator, 
"inr", deleted promoter). Figure 6 also depicts the comparison of basal and induced 
5 expression of human EPO in COS-1 cells using inducible hEPO plasnwds regulated by 
various GENESWrrCH® plasmids (pGS1210 (CMV-GSv3.1) or pGS1539 (CMV- 
GSv4.0) according to one aspect of the present invention. 

Figure 7 depicts the coding sequences of the GENESWITCH® plasmids v.3.1 
(SEQ. ID. NO. 12) and v.4.0 (SEQ, K). NO. 13), wherein SEQ. ID. NO. 13 has a truncated 
10 GAL-4 DNA-binding domain. 

Figure 8 depicts the amino acid sequences of the GENESWITCH® regulator 
proteins (SEQ. ID. NOS. 14 & IS) encoded by the coding sequences depicted in Figure 7. 
SEQ, NOS. 12 & 13 respectively. 

Figure 9A depicts a schematic representation of relevant regions of an inducible 
15 promoter with unique restriction sites indicated. Figure 9B depicts the nucleic acid 
sequence of the 6X GAL-4/rATA promoter region SEQ. ID. NO. 25, and Figure 9C 
depicts the nucleic acid sequence of the TATA box, initiation ("inr") region and the IJT12 
Transcription factor binding site, SEQ. ID.NO. 17. 

Figure 10 depicts nucleic acid sequences of the inducible promoter regions of 
20 plasmids tested in development of an improved inducible promoter. SEQ. ID. NOS. 18, 
19, 20, & 21. 

Figure 11 depicts the plasmid map of pGS1694, GENESWITCH® Plasmid 
. Version 4.0 having a chicken skeletal a-actin ("SK") promoter according to one 
embodiment of the present invention. 
25 Figure 12 depicts the plasmid map of pEP1666, a expression-regulated hEPO 

expression plasmid, according to one embodiment of the present invention. 

Figure 13 depicts the complete nucleic acid sequence of pGS1694, SEQ. ID. NO. 

22. 

Figure 14 depicts the complete nucleic acid sequence of pEP1666, SEQ. ID. NO. 

30 23. 

Figure 15 depicts hematocrit regulation in mice by low oral doses of mifepristone 
(MFP) using an improved expression-regulated system according to one embodiment of 
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the present invention. A 1:1 mixture of the improved plasmids, pEP1705 (inducible 
mEPO) and pGS1694 (muscle-specific GENESWITCH® v.4.0) in a total dose of 150 
micrograms, was formulated with sodium poly-L-glutamate and delivered with 
electroporation to both tibialis and gastrocnemius muscles of mice on day 0. Beginning on 

5 day 25, oral MFP (A) was administered chronically (daily on weekdays) at doses of 0.0 
(O), 0.01 mg/kg (•), 0.03 mg/kg (A) or 0.10 mg/Kg (■). 

Figure 16A depicts the hematocrit regulation in rats following delivery of low 
plasmid doses of CMV-controUed EPO expression plasmid. Figure 16B depicts the 
hematocrit regulation in rats following the delivery of pEP1642 (inducible rat EPO) and 

10 pGS1694 (muscle-specific GH^SWITCH® v.4.0) and induction by MFP. 

Figure 17 demonstrates the rapid onset of erythroid hypocythemia in mice with 
constitutive EPO expression using a CMV-hEPO plasmid. 75 mg of pEP1549 (CMV- 
hEPO) was formulated with sodium poly-L-glutamate and delivered with electroporation 
to both tibialis and gastrocnemius muscles of five mice. Figure 17A demonstrates levels 

15 of hEPO expression exhibited by 5/5 mice (mean ± SEM). Figure 17B demonstrates 
hematocrit levels exhibited by 5/5 mice (mean ± SEM). 

Figure 18 depicts the long-term regulation of a foreign transgene (human EPO) in 
mice using an improved expression-regulated system according to one embodiment of the 
present invention. Figure 18A depicts the level of hEPO in the serum of mice and Figure 

20 18B depicts the % hematocrit level. 

Figure 19 depicts the quantitative PGR results showing the duration of retention of 
the mouse EPO plasmid DNA following delivery by electroporation using saline and poly- 
L-glutamic acid formulations. 

Figure 20 depicts EPO expression and hematocrit levels in mice following deUvery 

25 of the mouse EPO gene by electroporation using saline and poly-L-glutamate 
formulations. 

Figure 21 depicts EPO expression in mice following delivery of the mouse EPO 
gene by electroporation using saline and poly-L-glutamate formulations over a three- 
month time frame. 

30 Figure 22 depicts the constitutive expression of hF.IX driven by a CMV enhancer 

promoter in CD-I mice. 
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Figure 23 depicts the regulated expression of hF.IX in CD-I mice using an 
expression-regulated system according to one embodiment of the present invention. The 
first administration of mifepristone (filled triangles) was 28 days after injection. 

Figure 24 depicts the regulated expression of hIFN-alpha2B in a mouse model. 

5 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
With long term expression of gene products, and particularly in circumstances in 
which physiological effects are unpredictable, or are associated with adverse effects or 
toxicities, delivery of certain genes or gene products may require the ability to closely 

10 control expression of transfected genes from outside the body. 

Ideally, control of transgene expression via a "molecular switch" should allow 
specificity, selectivity, precise timing and level of expression, safety, and rapid clearance 
of the triggering compound. According to one aspect of the present invention, a system 
for regulating gene expression is generally depicted in Figure 2 using the EPO gene as an 

15 example. The "niolecular switch", expression system is generally comprised of two 
nucleic acid or expression cassettes: (1) a molecular switch or otherwise called a 
GENESWirCH® plasmid, and (2) an mducible gene plasmid (e.g., Inducible EPO 
plasmid, inducible F.IX plasmid, inducible IFN plasmid). Although Figure 2 suggests that 
the two nucleic acid cassettes are carried on two different plasnoid vectors, the two nucleic 

20 acid cassettes may also be carried on two viral vectors or combined in a single plasmid 
vector or a single viral vector having both nucleic acid cassettes. 

The term "molecular switch plasmids," as used herein, refers to plasmids encoding 
chimeric transcriptional regulator or "molecular switch" molecules or pioteins having, but 
not limited to: 1) a sequence specific DNA binding domain (DBD) such as for example the 

25 GAL-4 DBD; 2) a mutated steroid receptor ligand binding domain such as for example a 
human progesterone receptor ligand binding domain having a C-tenninal deletion of about 
19-66 amino acids wherein the mutant may be activated in the presence of an antagonist 
for the naturally occurring or wild-type progesterone receptor, and 3) a transactivation 
domain, such as for example the herpes virus VP-16 or NFkappaB p65 transactivation 

30 domain. The transactivation domains may also be selected from a number of other 
transactivation domains known to those of skill in the art, such as for example, TAF-1, 
TAF-2, TAU-1, and TAU-2. 
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The term "GENESWITCH®" is a registered trademark of Valentis, Inc. and is 
used to identify "molecular switch plasmids," "molecule switch" proteins or molecules, 
and expression systems generated by Valentis, Inc. The prefix "pGS" is abbreviation 
identifying GENESWITCH® plasmids. 

5 Figure 2 generally depicts the interaction of the molecular-switch plasmid and 

proteins with the inducible gene plasmid. La general, the first expression cassette 
contained in the GENESWITCH® plasmid may comprise a promoter driving the 
expression of a fusion or chimeric protein. The promoter may be any promoter such as a 
CMV promoter or a tissue-specific promoter for expression in an animal cell. A preferred 

10 promoter for use with one aspect of the invention is a muscle-specific promoter with 
advantages as will be discussed below. The fusion or chimeric protein expressed from the 
fusion or chimeric gene generally comprises three structural domains represented by GAL- 
4, hPR LBD, and p65 in the GENESWITCH® plasmid in Figure 2. These three domains 
also correlate to the functional domains of the fusion protein. 

15 For example, GAL-4 represents nucleic acid sequence correlating to the GAL-4 

DNA-binding domain responsible for the interaction or binding of the fusion protein to the 
GAL-4 sites in the inducible promoter of a second nucleic add cassette, depicted as the 
inducible EPO plasmid. p65 represents nucleic acid sequence correlating to the 
transcription regulatory domain of the NFkappaB p65 protein. 

20 The hPR LBD correlates to the ligand-binding domain of the fusion protein, which 

is responsible for the interaction of the fusion protein with a ligand represented by an oral 
anti-progestin in Figure 2. In a specific example, the ligand-binding domain (LBD) is 
derived from the amino acid sequence correlating to the ligand-binding domain of human 
progesterone receptor QiPR), a receptor in the steroid-receptor family. As will be 

25 discussed in greater details below, the amino acid sequence in the LBD of hPR may be 
mutated to result in a mutated hPR LBD (or, more generally, a mutated steroid-receptor 
LBD) that selectively binds to the anti-progestin instead of progestin, the natural 
ligand/agonist of the progesterone receptor. Through a mutated hPR LBD, the fusion 
protein may, thus, be selectively activated by an anti progestin, instead of the naturally 

30 occurring progestin. Thus antiprogestins bind to the natural PR, but act as antagonists. 
Progestins bind to the natural PR and act as agonists. Progestins do not bind to the 
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truncated PR and thus have no activity. The mutated PR retains the ability to bind 
antiprogestins, but now they act as agonists. 

As shown generally in Figure 2, when the anti-progestin binds to the fusion protein 
expressed from the GENESWITCH® plasmid, the fusion protein is activated and forms a 

5 dimer complex. The dimer/anti-progestin complex, in turn, binds to the promoter of the 
inducible EPO plasmid and transactivates the transcription of the EPO gene. It should be 
noted that the specific nucleic acid structures depicted in the two nucleic acid cassettes in 
Figure 2 are provided as examples, and various modifications can be made to achieve a 
similarly tightly regulated expression system. 

10 For example, the transregulatory domain represented by NFkappaBp6S may be 

substituted with various other transregulatory domains such as, for example, VP-16, TAF- 
1 and TAF-2 (Pham et al. Mol Endocrinol 1992 Jul;6(7): 1043-50), TAU-1 CDahhnan- 
Wright et al. (1994) Proc. Natl. Acad. Sd. U.S.A. 91, 1619-1623), TAU-2 (Milhon et al. 
Mol Endocrinol 1997 Nov;ll(12):1795-805), ORF-10 (Moriuchi et al. Proc Natl Acad Sci 

15 USA 1995 Sep 26;92(20):9333-7 ); TEF-1 (Hwang et al. EMBO J 1993 Jun;12(6):2337- 
48) and any other nucleic acid/amino acid sequences having a transcription regulatory 
function. In one embodiment, the preferred transactivation domain is a human 
transactivation domain, such as for example NFkappaB p65, where relative humanization 
of the molecular switch protein is desired. 

20 The DNA-binding domain and the corresponding 6x GAL-4 binding site in the 

inducible gene plasmid should not be seen as being limited to the modified GALr4 DNA- 
binding domain described herein. Other DNA binding domains that have been altered to 
remove sequences that are not essential for recognition of binding sites but may be 
predicted to contribute to autodimerization by virtue of their secondary stracture may also 

25 be used Other DNA binding domains that may be so modified include for example the 
known DNA binding domains of the steroid-receptor family (e.g., glucocorticoid receptor, 
progesterone receptor, retinoic acid receptor, thyroid receptor, androgen receptor, 
ecdysone receptor) or other cellular DNA binding proteins such as the cAMP Response 
Element Binding protein (CREB) or zinc finger DNA binding proteins, such as SPL The 

30 GAL-4 DNA-binding domain is preferred in one embodiment because it readily allows for 
greater control and selectivity of gene activation using this expression system in 
mammalian cells. 
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The steroid-receptor family of gene regulatory proteins is also ideal for the 
construction of molecular switches. Steroid receptors are ligand activated transcription 
factors whose ligands can range from steroids to retinoids, fatty acids, vitamins, thyroid 
hormones, and other presently unidentified small molecules. These compounds bind to 
5 receptors and either up-regulate or down-regulate the expression of steroid-regulated 
genes. The compounds are usually cleared from the body by existing mechanisms and are 
usually non-toxic. The term "ligand," as used herein, refers to any compoimd or molecule 
that activates the steroid receptor, usually by interaction with (binding) the ligand-binding 
domain (LBD) of the steroid receptor. 

10 The term "steroid-honnpne leceptor" as used herein refers to steroid-hormone 

receptors in the superfamily of steroid receptors, some of which are known steroid 
receptors whose primary sequence suggests that they are related to each other. 
Representative examples of the steroid-hormone receptors include the estrogen, 
progesterone, glucocorticoid-ot, glucocorticoid-P, mineralocorticoid, androgen, thyroid 

15 hormone, retinoic acid, retinoid X, Vitamin D, COUP-TF, ecdysone, Nurr-1 and orphan 
receptors. The receptors for hormones in the steroid/thyroid/retinoid supergene family, for 
example, are transcription factors that bind to target sequences in the regulatory regions of 
hormone-sensitive genes to enhance or suppress their transcription. These receptors have 
evolutionarily conserved similarities in a series of discrete structural domains, including a 

20 ligand binding domain (LBD), a DNA binding domain (DBD), a dimerization domain, and 
one or more trans-activation domain(s). 

Various mutations or changes in the anuno acid sequences of the different 
structural domains may be generated to form a mutated steroid receptor or, more 
specifically, mutated steroid-hormone receptor. The term "mutated steroid receptor," 

25 "modified steroid receptor," or a "mutated steroid-hormone receptor" or "modified steroid- 
hormone receptor," as used herein is a steroid receptor or steroid-hormone receptor that 
has been mutated in its amino acid sequences such that the mutated form is capable of 
preferentially binding to a non-natural or non-native ligand rather than binding to the wild 
type, or naturally occurring, hormone receptor ligand. Usually this mutation is generated 

30 in the ligand-binding domain of the steroid receptor and may be denoted as "mutated 
steroid-receptor LBD." A mutated steroid receptor has the property to activate 



f 

wo 02/24899 PCT/USOl/30305 

13 

transcription of a desired gene (such as a gene encoding erythropoeitin) in the presence of 
an antagonist for a wild type steroid hormone receptor protein. 

Normally, a non-natural or non-native ligand may act as an antagonist or may have 
an antagonist effects to a wild-type steroid receptor or steroid-hormone receptor. 

5 "Antagonist" as used herein is a compound that interacts with or binds to a native steroid 
hormone receptor and blocks the activity of the agonist of the native steroid hormone 
receptor. "Agonist" as used herein is a compound that interacts with the wild type steroid 
hormone receptor to promote a transcriptional response. 

For example, progesterone or progestin is an agonist for the progesterone receptor 

10 because progesterone normally binds to the progesterone receptor to activate the 
transcription of progesterone-regulated genes. Compounds, which mimic progesterone, 
would also be defined as progesterone receptor agonists. Mifepristone (MFP) or otherwise 
known as RU486 is a non-natural ligand that also binds to the progesterone receptor and 
competes with progesterone for binding. Although under certain special circumstances, 

15 MFP may slightly activate certain progesterone-regulated genes through the progesterone 
receptor, the amount of activation is minimal when compared to the major activity of 
MFP, which is to block the activation of the progesterone receptor by progesterone. 
Hence, in the presence of progesterone and the progesterone receptor, MFP exerts an 
antagonistic effect on the progesterone receptor because it blocks the normal activation of 

20 the receptor by progesterone. 

The progesterone receptor may be mutated, e.g. in the ligand-binding domain of 
the progesterone receptor, such that it only binds to MFP and not to progesterone. The 
mutation of the ligand-binding domain of progesterone receptor may be such that binding 
of the MFP may actually activate the progesterone receptor under typical cellular 

25 conditions. When a mutated PR LBD, or more generally any other mutated steroid- 
receptor LED, is combined as a fusion protein with a particular DNA-binding domain 
such as the GAL-4 DNA binding domain, binding of MFP selectively activates the fusion 
protein to transactivate gene expression driven by a promoter recognized by the DNA- 
binding domain. Thus, the mutated steroid receptor of the subject invention is not 

30 activated in the presence of agonists for the native receptor, but instead the mutated steroid 
receptors may be activated in the presence of "non-natural ligands." 
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The term "non-natural ligands" or "non-native ligands" refers to compounds which 
are normally not found in animals or humans and which bind to the ligand binding domain 
of a receptor. Examples of non-natural ligands and non-native ligands are anti-hormones 
that may include the following: lip-(4-dimethylaminophenyl)-17p-hydroxy-17a- 

5 propynyl-4, 9-estradiene-3-one (RU38486 or Mifepristone); lip-(4- 
dimethyIaininophenyl)-17a-hydroxy-17P-(3-hydroxypropyl)-13 a methyl-4, 9-gonadiene- 
3-one (ZK98299 or Onapristone); lip-(4-acetylphenyl)-17P-hydroxy-17a-(l.piopynyl> 
4,9-estradiene-3-Dne (ZK112993); liP-(4-dimethylaminophenyl)-17p-hydroxy-17a-(3- 
hydioxy-1 (Z)-propenyl-estra-4,9-diene-3-one (ZK98734); (7P, 1 ip, 17P)-1 1-(4- 

10 dimethylaminophenyl>7-methyl-4*,5'-dihydrospiio [ester-4,9-diene"17,2' (3H)-ftiran]-3- 
one (Org31806); (lip ,14p, 17a)-4', 5*-dihydro-ll-(4-dimethylammophenyl)-[spiroestra- 
4, 9-diene-17,2X3'H)-furan]-3-one (Org31376); 5-a-pregnane-3,2-dione, Org 33628 
(Kloosterboer et al. Ann N Y Acad Sci 1995 Jun 12;761:192-201), Org 33245 (Schoonen 
et al. J Steroid Biochem Mol Biol 1998 Feb;64(3-4): 157-70).. 

15 "Mutant," "mutation," "mutated," "modified," or "modification" refers to an 

alteration of the primary sequence of a receptor or any other gene or protein such that it 
differs from the wild type or naturally occurring sequence. For example, a mutant, mutated 
or modified steroid-hormone receptor protein as used in this disclosure can be a mutant of 
any member of the steroid-hormone receptor superfamily. For example, a steroid receptor 

20 can be mutated by addition of amino acid(s), substitution of amino acid(s) or deletion of 
amino acid(s). Preferably, the deletion of the amino acids occurs on the carboxy terminal 
end of the protein. Generally, a deletion of from about 1 to about 120 amino acids from 
the carboxy terminal end of the protein provides a mutant useful in the present invention. 
A person having ordinary skill in this art will recognize, however, that a shorter deletion of 

25 carboxy terminal amino acids will be necessary to create useful mutants of certain steroid 
hormone receptor proteins. For example, a mutant of the progesterone receptor protein 
will contain a carboxy terminal amino add deletion of from about 1 to about 60 amino 
acids. In another embodiment, 19 carboxy terminal amino adds are deleted from the 
progesterone receptor protein. 

30 Furthermore, a mutated steroid-hormone receptor LED may be selected based on 

the ability of an antagonist for the wild-type steroid-hormone receptor to activate the 
mutant receptor even in the presence of an agonist for the wild-type receptor. Thus, in the 
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case of the progesterone receptor, progesterone is the normal ligand and functions as a 
strong agonist for the receptor. The anti-progestin, mifepristone (RU486), is a non-natural 
or non-native ligand for the progesterone receptor. Mifepristone (MFP) is considered an 
"anti-progestin" because, although it is able to exert a slight agonist effect on the wild-type 

5 progesterone receptor, MFP inhibits the strong agonistic effects of progesterone. Thus, 
MFP may be considered an "antagonist" for the wild-type progesterone receptor when in 
the presence of the normal agonist, i.e. when both MFP and progesterone are together in 
the presence of the wild-type progesterone receptor. In contrast, in one example of a 
mutated steroid-hormone receptor according to the invention, the mutated progesterone 

10 receptor is not activated by progesterone (agonist for the wild type receptor) but is 
activated in the presence of MFP ("antagonist" for the wild type receptor). In addition, 
progesterone is not able to block the activation of the mutated steroid-hormone receptor by 
MFP. Thus, the mutated receptor may be characterized as activated when bound to an 
antagonist (MFP) for the wild-type receptor even in the presence of an agonist 

15 (progesterone) for the wild-type progesterone receptor. 

Further examples of mutated and modified steroid-hormone receptor for used with 
the current invention are described in, for example: (1) "Adenoviral Vector-Mediated 
Delivery of Modified Steroid Hormone Receptors and Related Products and Methods" 
International Patent PubUcation No. WO0031286 (PCT/US99/26802); (2) "Modified 

20 Glucocorticoid Receptors, Glucocorticoid Receptor/ Progesterone Receptor Hybrids" 
International Patent Publication No. W09818925 (PCT/US97/19607); (3) "Modified 
Steroid Hormones for Gene Therapy and Methods for Their Use" International Patent 
PubUcation No. WO9640911 (PCT/US96/0432); (4) "Mutated Steroid Hormone 
Receptors, Methods for Their Use and Molecular Switch for Gene Therapy" International 

25 Patent Publication No. WO 9323431 (PCTAJS93/0439); (5) "Progesterone Receptors 
Having C-Teiminal Hormone Binding Domain Truncations", U.S. Patent No. 5,364,791; 
(6) "Modified Steroid Hormone Receptors, Methods for Their Use and Molecular Switch 
for Gene Therapy" U.S. Patent No. 5,874,534; and (7) "Modified Steroid Hormone 
Receptors, Methods for Their Use and Molecular Switch for Gene Therapy" U.S. Patent 

30 No. 5,935,934, all of which are incorporated herein by reference in their entirety, 
including any drawings. 
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Although the examples provided in this disclosuie use plasmid-based mutated 
steroid hormone, receptor system, viral-based versions of a mutated steroid hormone 
receptor system according to the present invention may also be used to regulate gene 
expression in vitro or in vivo. There are several specific examples of the use of early 
5 versions of inducible transcriptional regulator system in viral vectors. For example: (1) 
the positive and negative regulation of gene expression in eukaryotic cells with an 
inducible transcriptional regulator is described in Wang, et al., Gene Therapy, 4: 432- 
441,1997; and (2) drug inducible transgene expression in brain using a herpes simplex 
virus vector is described in Oligino, et al.. Gene Therapy, 5: 491-496,1998. These above 

10 examples may be applied to provide for viral-based regulated gene-expression systems 
according to the present invention. 

To improve the tightness of regulated expression, several aspects of the 
GENESWrrCH® expression cassette and the inducible therapeutic molecule expression 
cassette were modified. First, the transcribed portion of the GENESWITCH® expression 

15 cassette was modified to include post-transcriptional elements (5' UTR, synthetic intron 
and poly(A) signal) that improve the level and fidelity of transgene expression. Second, 
the structure of the GENESWITCH® regulatory protein was modified. The regulator 
protein appears to have a propensity, in the absence of ligand, to form dim^ that could 
bind GAL-4 sites in the inducible promoter and thereby partially activate transcription. To 

20 minimize this possibility, truncation or mutation on the GAL-4 domain of the regulator 
protein may be made such as deleting from the C-terminal portion of the GAL-4 DBD, 
about 20 residues, thereby reducing the length of a coiled-coil structure that was predicted 
to contribute to GAL-4 homodimer formation. GENESWITCH® regulator protein v.4.0 
embodies this modification. Thkd, the CMV promoter of the GENESWITCH® 

25 expression cassette may be replaced with a tissue-specific promoter such as avian skeletal 
alpha-actin promoter, which is muscle-specific. Under certain circumstanqes the muscle 
specific may be more or less active than the CMV promoter in muscle tissue. 

The inducible therapeutic molecule expression plasmid may be also that modified in 
the core region of the inducible promoter. It has been found that delivery of 75 

30 micrograms of the original inducible mEPO plasmid (pEP1442) by itself (in the absence of 
the GENESWITCH® plasmid) caused hematocrit level in mice to increase to 50% or 
greater. It was then determined that a deletion in the transcription initiation region of the 
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inducible EPO plasmid can reduce the intrinsic activity of the promoter by approximately 
10-fold without impairing its ability to be induced. When tested in combination with an 
improved GBNESWITCH v.4.0 plasmid in transfected COS-1 cells, the level of basal 
expression was reduced 8-fold and the level of induced expression remained essentially 
5 unchanged. It should be noted that the modifications described above may be employed 
independently or in combination with each other depending on the desired effect. 

In one embodiment of the present invention, a molecular switch protein comprising 
a chimeric receptor having a mutated progesterone-receptor ligand-binding domain, a 
truncated GAL-4 DNA binding domain, and a VP16 or p65 transregulatory domain is 

10 disclosed. The p65 transregulatory domain is part of the activation domain of the human 
p65 protein; a component of the NFkappaB complex. By replacing VP16 with a variety of 
human-derived activation domains such as, for example, residues 286-550 of the human 
p65, the potent inducibility of the chimeric receptor can be retained while "humanizing" 
the protein or reducing the potential for a foreign protein immune response due to the viral 

15 VP16 component. In the presence of the anti-progestin MFP (RU486), this chimeric 
regulator binds to a target nucleic acid sequence containing a 17-mer GAL-4 binding site, 
and results in an efficient ligand-inducible transactivation of the target gene downstream 
of the GALr4 binding site. The modified steroid-hormone ligand-binding domain of the 
receptor protein may also be modified by deletion of caiboxy terminal amino acids, 

20 preferably, from about one to one hundred-twenty carboxy terminal amino acids. The 
extent of deletion desired can be modulated according to conventional molecular 
biological techniques to achieve both selectivity for the desired ligand and high 
inducibility when the ligand is administered. In one embodiment, the mutated steroid 
hormone receptor LBD is mutated by deletion of about one to about sixty carboxy 

25 terminal amino acids. In another embodiment forty-two carboxy terminal amino acids are 
deleted. In yet another embodiment, having both high selectively and high inducibility, 
nineteen caiboxy terminal amino acids are deleted. 

As used herein, the GAL-4 DNA Binding Domain ("GAL-4 DBD") refers to 
amino acids 1 - 93 of the N-terminal DNA binding domain of GAL-4 as shown in Fig. 5 

30 SEQ. ID. NO, 10. As used herein, a "modified GAL-4 DBD" or mutated GAL-4 DBD 
refers to a GAL-4 DBD that has a mutation in the primary amino acid stmcture, or to a 
amino acid sequence derived from the GAL-4 DBD, that retains the ability to bind to the 



wo 02/24899 



PCT/USOl/30305 



18 

canonical 17-mer binding site, CGGAAGACTCTCCTCCG, (SEQ.ID.NO. 9), but has a 
reduced ability to form a helical tertiary structure needed for autodimerization. In one 
example, a deletion of a region represented by amino acids 75 to 93 of the native GAL-4 
DBD as depicted in Figure 5, provides for a modified or mutated GAL-4 DBD that when 
5 combined with the GENESWUCH® regulator protein decreases the basal expression of 
EPO from an inducible expression plasmid. Other mutations or deletions may also be 
made to the region spanning amino acid sequence 54-74 of the GAL-4 DBD as shown in 
Figure 5. For example, a deletion of amino 54-64, or 65-75 may be made such that 
autodimerization through the coiled coil region is minimized. 

10 In one example, an optimized transgene regulation system is disclosed below 

which meets desired criteria for a robust system. In particular, an improved regulated 
muscle-specific EPO/GENESWITCH® system disclosed in one embodiment herein 
provides an undetectable biological effect from a phannacologicai dose of introduced 
erythropoietin transgene in the absence of inducer. By '^biological effect" it is meant that, 

15 although it may be possible to detect the production of messenger RNA by ultra sensitive 
assays such by Pplymerase Chain Reaction ("PC3l")> no physiologic effect, such as for 
example in the cases of EPO, no rise in hematocrit, is observed In one experiment using 
the most sensitive measurement of basal EPO expression, i.e., an increase in hematocrit 
level, the increase did not occur in mice with the improved mEPO/GENESWITCH® 

20 system, in rats with the rEPO/GENESWITCH® system, or in mice with 
hEPO/GENESWrrCH® system, even at plasmid doses (7.5 mg/Kg body weight) that 
greatly exceed those projected to be used in humans. In addition, serum levels of hEPO 
were undetectable in the absence of inducing drug when plasmids were delivered to dogs 
at 3.5 mg/Kg. 

25 On the other hand, the improved system responded to doses of MFP as low as 0.01 

mg^g and stable increases in hematocrit were maintained by a chronic dosing schedule. 
Responsiveness to low doses of MFP is highly favorable; especially since chronically 
administered 25 mg doses (0.25-0.5 mg/Kg) are well tolerated in humans. 

The kinetics of induction in response to oral MFP dosing was also rapid. In mice, 

30 peak expression occurs within 24 hours, and in dogs, peak expression occurs within 48 
hours. In naice, baseline expression levels were restored after 72-96 hours after cessation 
of MFP treatment. In dogs, near baseline levels of expression were observed 3 days after 
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cessation of MFP treatment. Responses following a single administration of plasmid were 
also durable and consistent. The consistency in the kinetics and peak levels for each of the 
four cycles of hEPO induction over an 8-month period in mice is a 'particularly good 
example. 

5 The improved system may also be scalable to larger animals. In dogs, for example, 

inducible EPO expression occurred at plasmid doses of 0.5-3.5 mg/Kg. In rats, plasmid 
doses as low as 0.03 mg/Kg were also sufficient. 

No evidence for immunogenicity or toxicity due to expression of the chimeric 
GENESWITCH® regulator protein was found. The ability of the improved plasmid-based 

10 system to avoid neutralizing inunune responses and thereby achieve long-term (8-month) 
regulated expression of a foreign transgene (human EPO) in a majority of mice is a 
surprising finding. Tig^t expression control may permit a pulsatile program of transgene 
expression that may deprive the immune system of a source of antigen at certain critical 
times. The control of neutralizing immune responses to foreign transgene products is 

15 directly relevant to the treatment of genetic deficiencies, like hemophilia A or B in which 
the normal protein can appear to be a foreign antigen. The above benefits of the presently 
disclosed improved regulated EPO/GENESWITCH® system represent an important 
advance in gene therapy. 

The improved system may also have several optional components that permit 

20 advantages over existing systems. First, the present improved GENESWITCH® regulator 
protein is mostly humanized (86%) (amino acid sequences derived fi-om human proteins 
except for the GAL-4 DNA-binding domain), thus reducing the potential for long-term 
immune responsiveness to the system in immune-competent recipients. 

Second, exogenous control of expression in the present system having undetectable 

25 baseline expression may also permit a lag time prior to first induction by MFP or other 
inducer. The length of the lag period between plasmid delivery and the first induction of 
transgene expression permits reduction in the potential for developing immune responses 
to the expressed transgene. Delivery of plasmids with electroporation, for example, is 
associated with transient inflammation and cellular infiltration that are able to activate 

30 dendritic cell maturation. Foreign transgene expression that is induced after inflanomation 
at the muscle site has subsided avoids transgene expression in a hyperinflammatory 
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environment Third, use of a muscle-specific promoter may also provide low level 
GENESWITCH® protein production and may minimize expression in non-muscle cells. 

An IFN gene therapy is an attractive potential therapy for a number of indications 
including cancer and chronic viral infections. Sequences for a nxmiber of different 
5 interferons, including interferon beta, gamma, omega and over 20 interferon alpha species, 
some of which may be pseudogenes, can be found in GenBank (for example, the reference 
sequence of interferon alpha 2 NM__000605). Variants of human leukocyte interferon 
alpha 2 (IFN-alpha 2a, alpha 2b, and alpha 2c) differ from each other by changes in their 
coding regions at nucleotide positions 137 and 170. IFNalpha2b has been determined to 

10 be a predominant species among the alpha 2 interferons, however the present invention is 
applicable to and encompasses regulated expression of any one of the naturally occurring, 
modified or synthetic interferons. 

For chronic hepatitis C infection, current therapies with recombinant IFN-alpha 
protein and PEGylated forms of IFN-alpha are partially effective and require combination 

15 treatment with ribavirin. Advantages of PEGylated IFN-alpha ace reduction in the 
frequency of injection from 3X to IX weekly, and unprovement in therapeutic response. 
However, all of the routinely injected IFN-alpha protein therapies are associated with 
substantial side effects. 

The elimination half-life of recombinant and pegylated INF-alpha proteins in 

20 humans is 4.8-7.2 h and 27-39 h, respectively. From published pharmacokinetic data on 
PEG-Intron A in humans (Glue et al, 2000, Hepatol. 32:647; Glue et al, 2000, Clin Pharm 
Ther 68:556), it is calculated using a conversion factor (1 unit = 3.714 picogram; 1 
picogram = 0.269 units), that continuous expression of -100 pg/ml should provide the 
same area under the curve of serum interferon levels as weekly treatments with PEGylated 

25 interferon alpha. This value probably deifines the upper limit, as continuous infusion of 
non-pegylated INF-alpha protein in hepatitis C patients yielded therapeutic benefit at 
circulating levels of -15 pg/ml (Schenker et al., 1997, J. Int. Cyt. Res. 17:665; Smith et al., 
1987, Cane. Chemother. Pharmacol. 20:327). Moreover, the side effects resulting from 
continuous infusion were lesser in intensity than those resulting from intermittent dosing, 

30 particularly when circulating levels were less than 200 pg/ml. 

Plasmid encoding human interferon alpha QiBPN-alpha) was injected into the 
tibialis of mice followed by administration of electrical pulses (electroporation) to 
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facilitate plasmid uptake. Using a plasmid driven by the CMV enhancer/promoter and 
injected plasmid doses of 1.0 micrograms xo 100 micrograms, a dose response of hlFN- 
alpha expression was achieved with peak levels of 16,458 ± 897 pg/ml with 
electroporation. Expression without electroporation was approximately 70-fold lower 
5 (238 ± 100 pg/ml). The ability to tightly regulate expression of an interferon gene therapy 
is expected to be a key component of the safety profile. Accordingly, hlFN-alpha was 
expressed in mice using the GENESWITCH® regulatory system. A two plasmid mixture 
was injected into the tibialis of mice. One plasmid encoded the GENESWITCH® protein 
under control of the skeletal actin promoter and the other plasmid encoded for hlFN-alpha 

10 containing the GAL4 binding sites linked to a consensus TATA box. Using this system 
hlFN-alpha levels peaked -24 hours after administration of mifepristone, the low 
molecular weight inducer. In mice, a pulsatile pattern of hlFN-alpha expression was 
achieved in CD-I mice for more than 30 days. Mean peak levels of hlFN-alpha protein 
exceeded 200 pg/ml. In C57BL/6 mice, mean peak levels of hlFN-alpha protein exceeded 

15 400 pg/ml and pulsatile regulation was achieved for -100 days. 

The term "expression cassette" or "nucleic acid cassette," as used herein refers to 
the combination of nucleic acid sequences involved in expression of a particular functional 
product. This functional product is typically a protein although it could also be a nucleic 
acid such as for example, an RNA molecule such as a ribozyme or antisense RNA. The 

20 expression cassette may also be comprised of a number of non-coding elements in addition 
to sequences encoding a product such as a protein. Non-coding elements are nucleic acid 
sequences bounded or defined by consensus sequences or having a contextual location 
identifiable or recognized by those of skill in the art. A "5' untranslated region" or "5' 
UTR" refers to a sequence located 3* to promoter region and 5' of the coding region. For 

25 example, the 5' end of the 5' UTR is typically defined as the transcription start site. 
Although the start of transcription may not be precisely known, it is often estimated to be 
approximately 30 base pairs 3' of the end of the TATA box. The 3' end of the 5TJTR 
would be defined as the base immediately 5' to the start codon (ATG). Thus, such a 
sequence, while transcribed, is upstream of the translation initiation codon and therefore is 

30 not translated into a portion of the polypeptide product Such a 5* UTR may also have an 
intron within it. In one embodiment of the present invention, the expression cassette' 
includes: promoter sequences, transcription start sequences, 5' untranslated ("5' UTR") 
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sequences, coding sequences from a start codon through a stop codon, and 3* untranslated 
sequences ("3' UTR") including polyadenylation sequences. As used herein the 5'UTR 
may include one or more functional non-coding elements able to increase the level and 
fidelity of expression. As used herein, the 5' UTR may include intron sequences that are 
5 transcribed and present in pre-messenger RNA ("pre-mRNA'), but are removed by 
splicing and are absent in the mature messenger RNA ("mRNA"). 

The term "intron" as used herein refers to a sequence encoded in a DNA sequence 
that is transcribed into an RNA molecule by RNA polymerase but is removed by splicing 
to form the mature messenger RNA. A "synthetic intron" refers to a sequence that is not 

10 initially replicated from a naturally occurring intron sequence and generally will not have 
a naturally occurring sequence, but will be removed from an RNA transcript during 
normal post-transcriptional processing. Such synthetic introns can be designed to have a 
variety of different characteristics, in particular such introns can be designed to have a 
desiied strength of splice site and a desired length. In a preferred embodiment of the 

IS present invention, both the molecular switch expression cassette and the therapeutic gene 
expression cassette include a synthetic intron. The synthetic intron includes consensus 
sequences for the 5' splice site, 3' splice site, and branch point. When incorporated into 
eukaryotic vectors designed to express therapeutic genes, the synthetic intron will direct 
the splicing of RNA transcripts in a highly efficient and accurate manner, thereby 

20 minimizing cryptic splicing and maximizing production of the desired gene product 

A "therapeutic molecule" or "therapeutic gene" is one that has a pharmacologic 
activity when administered appropriately to a manmial suffering from a disease or 
condition. Such a pharmacological property is one that is expected to be related to a 
beneficial effect on the course or a symptom of the disease or condition. The term 

25 "therapeutic protein" as used ha:ein refers to the native, full-length secreted form of a 
therapeutic protein, as well as to analogs or derivatives thereof comprising single or 
multiple amino acid substitutions, deletions or additions that retain native therapeutic 
protein function or activity. Sequences encoding therapeutic proteins may include codon 
optimized versions of native sequences. Optimal codon usage in humans is indicated by 

30 codon usage frequencies for highly expressed human genes and may be determined from 
the program "Human High.codN" from the Wisconsin Sequence Analysis Package, 
Version 8.1, Genetics Computer Group, Madison, WI. The codons that are most 
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frequently used in highly expressed human genes are presumptively the optimal codons 
for expression in human host cells, and thus form the basis for constructing a synthetic 
coding sequence. 

The term "plasmid" as used herein refers to a construction comprised of 
5 extrachromosomal genetic material, usually of a circular duplex of DNA that can replicate 
independently of chromosomal DNA. Plasmids may be used in gene transfer as vectors. 

The term "vector" as used herein refers to a construction comprised of genetic 
material designed to direct transformation of a targeted cell. A vector may contain 
multiple genetic elements positionally and sequentially oriented with other necessary 

10 elements such that an included nucleic acid cassette can be transcribed and when 
necessary translated in the transfected cells. As used herein the term "expression vector" 
refers to a DNA plasmid that contains all of the information necessary to produce a 
recombinant protein in a heterologous cell. 

"Erythropoietin", or "EPO," as used herein, refers to a glycoprotein hormone 

15 produced in fetal liver and adult kidney that acts to stimulate the formation of red blood 
cells from progenitor cells in the bone marrow and other hematopoietic tissue. Genes 
encoding human and other manmialian EPOs show a high degree of sequence homology 
in the coding region. (Wen et al. Blood 82:1507(1993)). The sequence of die gene 
encoding native human EPO, including methods of obtaining such sequences, are 

20 described in, for example among others: U.S. Patent Nos. 4,954,437 and 4,703,008; 
Shoemaker, U.S. Patent No. 4,835,260; Sytkowski and Grodberg, U.S. Patent Nos. 
6,153,407 and 5.614,184); Sytkowski, U.S. Patent No. 5,580.853; Mellovitz, U.S. Patent 
No. 5,888,772 and 5,457,089; Fibi and PoweU, U.S. Patent No. 5,688.679; Jacobs et al., 
Nature 313:806-810 (1985); Un et al. Proc. Natl. Acad. Sci. USA 82:7580 (1985); 

25 International Publication Number WO 85/02610; and European Patent Publication 
Number 232,034 Bl, all of which are hereby incorporated herein by reference in their 
entirety. 

The sequences of the genes encoding other mammalian EPOs such as for example 
feline (GenBank Acc. No.: L10606), canine (GenBank Acc. No.: L13027); porcine EPO 
30 (GenBank Acc. No.: L10607), and monkey (Macaca mulatta) (GenBank Acc. No.: 
L10609) are also known. 
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The tenn "phannacological dose" as used herein with a vector/molecular switch 
complex refers to a dose of vector and level of gene expression resulting from the action 
of the promoter on the nucleic acid cassette when introduced into the appropriate cell type 
which will produce sufficient protein, polypeptide, or antisense RNA to either (1) increase 
5 the level of protein production, (2) decrease or stop the production of a protein, (3) inhibit 
the action of a protein, (4) inhibit proliferation or accumulation of specific cell types, or 
(5) induce proliferation or accumulation of specific cell types. The dose will depend on 
the protein being expressed, the promoter, uptake and action of the protein or RNA. The 
term "pharaiacological dose" as used herein with a ligand refers to a dose of ligand 

10 sufficient to cause either up-regulation or down-regulation of the nucleic acid cassette. 
Thus, there will be a sufficient level of ligand such that it will bind with the receptor in the 
appropriate cells in order to regulate expression fi-om the nucleic acid cassette. The 
specific dose of any ligand will depend on the characteristics of the ligand entering the 
cell, binding to the receptor and then binding to the DNA and the amount of protein being 

15 expressed and the amount of up-regulation or down-regulation needed. 

In a plasmid based expression system, a non-viral gene medicine may also be 
composed of a synthetic gene delivery system in addition to the nucleic acid encoding a 
gene product (e.g., a therapeutic protein). The non-viral gene medicine products are 
generally intended to have low toxicity due to the use of synthetic components for gene 

20 delivery (minimizing for instance the risks of immunogenicity generally associated with 
viral vectors) and non-integrating plasmids for gene expression. Since no integration of 
plasmid sequences into host chromosomes has been reported in vivo to date, they should 
neither activate oncogenes nor inactivate tumor suppressor genes. This built-in safety with 
non-viral systems contrasts with the risks associated with the use of most viral vectors. As 

25 episomal systems residing outside the chromosomes, plasmids have defined 
pharmacokinetics and eliDoination profiles, leading to a finite duration of gene expression 
in target tissues. 

Formulating the nucleic acid with non-ionic and anionic polymers is particularly 
desirable because the polymers enhance transfection and expression of the nucleic acid, 
30 protect the nucleic acid from degradation, and are biodegradable. In addition, because 
formulating the nucleic acid with non-ionic anionic polymers results in more efficient 
transfection, lower amoimts of DNA may be used. By biodegradable, it is meant that the 
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polymers can be metabolized or cleared by the organism in vivo without any or minimal 
toxic effects or side effects. The term "anionic polymers" means polymers having a 
repeating subunit which includes, for example, an ionized carboxyl, phosphate or sulfate 
group having a net negative charge at neutral pH. Examples of the anionic polymers 

5 include poly-amino acids (such as poly-glutamic acid, poly-aspartic acid and combinations 
thereof), poly-nucleic acids, poly-acrylic acid, poly-galacturonic acid, and poly-vinyl 
sulfate. In the case of polymeric acids, the polymer will typically be utUized as the salt 
form. Examples of other polymers include PVP, PVA, chitosan, etc. 

The term "poly-L-glutamic acid" is used interchangeably herein with "poly-L- 

10 glutamic acid, sodium salt", "sodium poly-L-glutamate" and "poly-L-glutamate." "Poly- 
L-glutamate" refers to the sodium salt of poly-L-glutamic acid. Although the L 
stereoisomer of polyglutamic acid has been primarily employed, the other stereoisomer or 
racemic mixtures of isomers are within the scope of the invention. The present invention 
contemplates that other salts of anionic amino acid polymers may be equally suitable. 

IS The term "anionic amino acid polymers" means polymeric forms of a given anionic 

amino acid such as, for example, poly-glutamic acid or poly-aspartic acid. The present 
invention contemplates that polymers formed of a mixture of anionic amino acids, such as 
for example glutamic acid and aspartic acid, may be equally suitable. 

Efforts have been made to enhance the delivery of plasmid DNA to cells by 

20 physical means including electroporation, sonoporation and pressure. Injection by 
electroporation is a modem technique that involves the application of a pulsed electric 
field to create transient pores in the cellular membrane without causing permanent damage 
to the cell and thereby allows for the introduction of exogenous molecules. This technique 
has been used widely in research laboratories to create hybridomas and is now being 

25 applied to gene transfer approaches for therapy. By adjusting the electrical pulse 
generated by an electroporetic system, nucleic acid molecules can find their way through 
passageways or pores in the cell that are created during the procedure. U. S. Patent 
5,704,908 describes an electroporation apparatus for delivering molecules to cells at a 
selected location within a cavity in the body of a patient. (U.S. Patent 5,704,908, 

30 including any drawings contained therein, is hereby incorporated by reference as if fully 
set forth herein.) 
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The tenn "pulse voltage device", or "pulse voltage injection device" as used herein 
relates to an apparatus that is capable of causing or causes uptake of nucleic acid 
molecules into the cells of an organism by emitting a localized pulse of electricity to the 
cells, thereby causing the cell membrane to destabilize and result in the formation of 
. 5 passageways or pores in the cell membrane. It is understood that conventional devices of 
this type are calibrated to allow one of ordinary skill in the art to select and/or adjust the 
desired voltage amplitude and/or the duration of pulsed voltage and therefore it is expected 
that future devices that perform this function will also be calibrated in the same manner. 
The type of injection device is not considered a limiting aspect of the present invention. 

10 The primary importance of a pulse voltage device is, in fact, the capability of the device to 
facihtate delivery of compositions of the invention into the cells of an organism. The 
pulse voltage injection device can include, for example, an electroporetic apparatus as 
described in U.S. Patent 5,439,440, U.S. Patent 5,704,908 or U.S. Patent 5,702,384 or as 
pubKshed in PCT W6 96/12520, PCT WO 96/12006, PCT WO 95/19805, and PCT WO 

15 97/07826, all of which are incorporated herein by reference in their entirety. 

EXAMPLE 1: Plasmid delivery and Induction, 

For some experiments, inducible EPO (mouse, rat or human) and 
GENES WITCH® plasnaids were mixed in a 1:1 ratio and formulated with 6 mg/ml 
20 sodium poly-L-glutamate (Sigma, St Louis, Missouri) and 0.15 M NaCl. The maxin^l 
plasmid concentration of the formulated material was 1.5 mg/ml. Formulated plasmids 
were delivered to mouse and rat hinfd-limb muscles with 1.5 and 2.0 cm^ size plate 
electrodes. Female C57BL/6 mice (18-20 g) were purchased from Harlan Sprague- 
Dawley. 

25 Plasmid mixtures in saline or polymers were injected into bilateral hind-limb 

muscles (150 microliters total volumie: 25 microliters per tibialis cranialis, 50 microliters 
per gastrocnemius) by direct injection. Two minutes later, the injected hind-limb muscles 
were subjected to electroporation using two stainless-steel parallel-plate-caliper electrodes 
connected to an Electro Square Porator™, Model T820, and BTX Enhancer™ 400 

30 oscilloscope (BTX Division of GeneTronics, Inc., San Diego CA). The electrodes were 
placed non-invasively in contact with the skin of the leg (plate separation distance of 3 or 
4 DMn) and two square wave pulses were applied at 375 V/cm (pulse duration 25 ms). 
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These electtoporation conditions yielded elevated levels of transgene expression and 
improved reproducibility. 

Mifepristone (Sigma) was either dissolved in sesame oil and administered in a 100 
microliter volume (intraperitoneally or orally) or implanted subcutaneously as a pellet 

5 using a 10-gauge Precision Trochar (Linovative Research of America, Sarasota, FL). 
Blood was collected by retroorbital bleeds. Serum samples were assayed for SEAP 
activity, hVEGF, or EPO protein levels by ELISA (R&D Systems). All procedures 
conformed to state and federal guidelines. 

For delivery to dog muscles, 1:1 mixture of plasmids formulated in poly-L- 

10 glutamate were injected, using 2 ml per site, into two (bilateral semimembranosus) or six 
(bilateral semimembranosus, semitendanosus and triceps) muscle sites of adult dogs (3-7 
kg in body weight). Two minutes following injection, each site was subjected to 
electroporation with a 6-needle electrode array that consisted of six 22G needles arranged 
in a circle of 1.0 cm diameter and 1 cm length. Electroporation conditions w^ 200 

15 V/cm, 6 pulses of 60 ms duration, with 1 second between pulses, and polarity was rotated 
following each pulse. The 6-needle electrode array, Electro Square Porator™, model 
T820, and Enhancer™ 400 Graphic Pulse Display were from BTX Division of 
Genetronics, Inc. (San Diego, California). 

20 Determination of Expression Levels 

Murine and rat blood was collected by retro-orbital methods. Caniiie blood was 
collected by veriipuncture of the jugular vein. Whole blood was used for hematocrit 
measurements and serum was used for EPO assays by the R&D Systems ELISA 
Quantikine kit (R&D Systems, Minneapolis, ^Dnnesota). 

25 Quantities of mEPO plasmid DNA in murine muscles were detennined by 

TaqMan® real time quantitative PGR (Applied Biosystems, Foster City, California) as 
described in Mahato, R.L et al.. Hum. Gene Ther. 9:2083 (1998). The forward primer was 
from the 5' untranslated region, and the reverse primer was from the murine EPO coding 
region. CMV-mEPO plasmid DNA was used to generate a standard curve. 

30 For measuring human EPO, an indirect ELISA technique using rhEPO (R&D 

Systems, Minneapolis, MN) as the target antigen was used according to Urra, J. M., et al., 
Clin. Chem. 43:848 (1997). 
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Test of significance between groups were performed by One-Way ANOVA 
analysis (SPSS Base 9.0, Chicago, Illinois). P values <0.05 were considered significant. 

EXAMPLE 2: Expression of Erythropoietin in a Regulated Expression System 
5 Experiments were conducted to assess the ability of various GENESWTTCH® 

plasmids to regulate the long-term expression of a transgene, in particular, the 
erythropoietin (EPO) gene. The first GENESWITCH® system tested (GENESWITCH® 
version 3.1 ("GSvS.l")), was comprised of the pGS1158 (CMV-GENESWITCH® v.3.1 
gene) and the pEP1442 (inducible murine EPO (mEPO) gene). The pGS1158 plasmid is 

10 comprised of a nucleic acid cassette having a CMV promoter driving the expression of a 
fusion protein; the fusion protein being comprised of a NFkappaBp65 transactivation 
domain amino acids 285-551 ("p65"), a 92 amino-acid yeast GAL-4 sequence specific 
DNA binding domain ("GAL42-93")» and a mutant progesterone receptor ligand-binding 
domain lacking a C-terminal 19 amino acids ("hPR640-9i4 LED"). The nucleic acid coding 

15 sequence for the GENESWITCH® v.3.1 protein is shown in Figure 7, SEQ. ID. NO. 12 
and the amino acid sequence in the Figure 8, SEQ. ID, NO. 14. 

A mixture of pEP1442 (coding for a codon optimized murine erythropoietin 
(mEPO) gene expressed fix>m the inducible 6xGAL-4/TATA promoter) and pGS1158 
(CMV-GSv.3.1, includes a CMV promoter, a SV40 intron located within the 3' UTR and 

20 a SV40 polyadenylation signal) was delivered to murine skeletal muscle with 
electroporation, and monitored for the inducibility of mEPO expression by oral doses of 
MFP. 

Fig. 1 shows the regulation of murine EPO expression in mice for 1 year using the. 
GENESWrrCH® version 3.1 expression system. Briefly, a 1:1 mixture of the pEP1442 

25 (inducible mEPO) and pGSllSS at total dose of 150 micrograms, was delivered with 
electroporation to both tibialis and gastrocnemius muscles of three mice on day 0. MFP 
(0.3 mg/Kg) was administered on the days indicated by the open triangles (A). Serum 
mEPO levels were measured 24 hours after single MFP doses or on the fifth day of 5 daily 
doses. Detectable mEPO levels are indicated by filled bars; undetectable levels by open 

30 squares (□). The level of mEPO on day 69 is lower than expected, because the serum 
sample was inadvertently collected after 72 hours. Hematocrit levels are indicated by 
fiUed diamonds (♦). All data points are mean ± SEM. Similar patterns of mEPO 



wo 02/24899 



PCT/USOl/30305 



29 

induction (total of 11 induction cycles) occurred at various times throughout the 1-year 
period in response to single or multiple doses of MPP. Quantitative PGR was also used to 
measure the amount of plasmid retained in muscles harvested on day 413. The average 
amount of mEPO plasmid per animal was 14±6 pg (approximately 4 million copies). 

5 As seen in Figure 1, administration of MFP on day 7 induced mEPO expression; 

serum levels of mEPO appeared within 24 hours, and retumed to undetectable levels 
within four days. Importantiy, and as is apparent from Figure 1, although long-term 
inducibility of mEPO expression in response to MFP dosing was achieved, regulation of 
hematocrit level was not achieved. The level of hematocrit did not diminish significantly 

10 during the 1-year period. For example, an increase in hematocrit level occurred prior to the 
first dose of MFP, and elevated levels of 60-80% were maintained regardless of when or 
how often mEPO expression was induced (Fig. 1). Thus, although mEPO was not 
detectable by ELIS A, the amount of EPO produced was nonetheless sufficient to induce a 
maximal rise in hematocrit level. 

IS Similarly, in in vitro experiments, significant transgene expression in the absence 

of mifepristone (MFP) was also observed, although the induction levels over background 
nonetheless exceeded 100-fold in some cell lines. Thus, these sensitive in vitro and in 
vivo experiments revealed that the EPO transgene was being expressed even in the 
absence of an inducer Q^FP), 

20 This data also suggest that the hematocrit level is a particularly sensitive indicator 

of basal expression of EPO due to the ability of EPO to induce changes in hematocrit even 
at EPO levels that are undetectable by ELIS A. For these leasons, efforts weie undertaken 
to tighten regulation of transgene expression using erythropoietin as the model for the 
development of an improved tightly regulated system. 

25 In one embodiment of the present invention, the tightness of regulated EPO 

expression was improved by modifying certain aspects of the GENESWITCH® and 
inducible EPO plasmids. The starting GENESWITCH® plasmid was eitiier the pGS1158 
or analogous plasmids that code for GENESWITCH® regulator protein v3.1, and contain 
a CMV promoter and has an SV40 intron located in the 3' untranslated region and has an 

30 SV40 poly(A) signal. Modifications, as will be discussed in succeeding examples 
included: (1) modifications to the transcribed but untranslated portions of the 
GENESWnCH® plasmid; (2) truncation of the GALA DNA binding domain; (3) use of 
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a muscle specific promoter in the GENESWITCH® plasmdd; and (4) modification of the 
cpre promoter on the inducible therapeutic gene plasmid. It should be noted that these 
modifications may be applied independently or in combination with the GENESWITCH® 
system. 

5 

EXAMPLE 3: Modifications to the Transcribed but Untranslated Portions of the 
GENESWITCH® Plasmid 

The transcribed portions of the GENESWITCH® plasmid were modified to 

10 include post-transcriptional elements (5' UTR including a synthetic intron and poly (A) 
signal) that could be expected to improve the level and fidelity of transgene expression. In 
a preferred embodiment of the present invention, both the molecular switch expression 
cassette and the therapeutic gene expression cassette include a synthetic intron. 

Cryptic splicing in transcripts from eukaryotic expression vectors is obviously 

15 undesirable. To obtain control over the splicing pattern and to maximize gene expression, 
suboptimal introns can be replaced by a strong intron. A synthetic intron with consensus 
splicing sequences should be optimal for this purpose. The synthetic intron of the present 
embodiment includes consensus sequences for (he S' splice site, 3' splice site and branch 
point. When incorporated into eukaryotic vectors designed to express ther^eutic genes, 

20 the synthetic intron will direct the splicing of RNA transcripts in a highly efficient and 
accurate manner, thereby minimizing cryptic splicing and maximizing production of the 
desired gene product. 

The first and sixth position of the 5' splice site consensus sequence are partially 
ambiguous. The 5' splice site pairs with Ul snRNA. The chosen sequence minimizes the 

25 free energy of helix formation between Ul RNA and the synthetic 5' splice site. 

5'ss 5' CAGGUAAGU 3* SEQ.ID.NO: 1 

lllllllll 

Ul RNA 3' GUCCAUUCA 5* SEQ.ID.NO: 2 

30 

In mammals, the branch point sequence is very ambiguous. The branch point 
sequence, except for a single bulged A residue, pairs with U2 snRNA. The chosen 
sequence minimizes the free energy of helix formation between U2 RNA and the synthetic 
branch point sequence. It also matches the branch point sequence that is obligatory for 
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yeast pre-mRNA splicing. The branch point is typically located 18-38 nts upstream of the 
3' splice site. The branch point of the synthetic intron is located 24 nts upstream from the 
3' splice site. 

5 

BP 5' UACUA^'C 3' SEQ.ID.NO: 3 

Mill I 

U2 RNA 3' AUGAU G 5' SEQ.ID.NO: 4 

10 The polypyrimidine tract of the consensus sequence for 3' splice sites is not exactly 

defined. At least 5 consecutive uracil residues are needed for optimal 3' splice site 
function. This concept is incorporated into the polypytinoidine tract of the synthetic 
intron, which has 7 consecutive uracil residues. 

Splicing in vitro is optimal when introns are >80 nts in length. Although many 

15 introns may be thousands of bases in length, most naturally occurring introns are 90-200 nt 
in length. The elemental structure of a synthetic intron according to the present invention 
(SEQ.ID.NO: 5) is shown in Figure 3 compared with italicized consensus sequences 
(SEQ.ID. 6). 

In one synthetic intron embodiment of the present invention, IVS8, the length of 
20 the synthetic intron is 118 nucleotides. The sequence of IVS8, (SEQ.ID.N0:7), is shown 
in Figure 4. Exonic sequences are in boldface. N = any base. Consensus splicing signals 
are double-underlined. Restriction enzyme recognition sites are over-lined. The 
restriction enzyme Bbsl may be used to cleave the DNA precisely at the 5' splice site, and 
Earl may be used to cleave the DNA precisely at the 3' splice site. The two restriction 
25 sites, Bbsl and Earl, located within the synthetic intron, permit the intron to be easily and 
precisely deleted. The PstI and Nhel sites are included to facilitate the verification of 
cloning procedures. Double-stranded DNA with this sequence may be prepared using 
mutually priming long oligonucleotides. 

To more closely match the stmcture of naturally occurring genes, which typically 
30 contain many introns, the synthetic intron may be inserted into the gene of interest at 
multiple locations. When multiple introns are inserted, however, care must be taken to 
ensure that the lengths of resultant internal exons are less than 300 nucleotides. If internal 
exons are greater than 300 nucleotides in length, exon skipping may occur. 
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In one embodiment, the expression cassette was further modified to introduce a 
CMV 5' UTR, termed UT12 (SEQ. ID. N0:8) in addition to the synthetic intron, IVS8 
(SEQ.ID.NO:7), within the 5' UTR. The SV40 polyadenylation signal was replaced with 
a human growth hormone ("hGH") poly (A) signal. 
5 These modijBcations, or other expression cassette modifications known to those of 

skill in the art, may be employed to generally increase the level and fidelity of transgene 
expression from plasmid and viral vectors. 

EXAMPLE 4: Truncation of the GAL-4 DNA Binding Domain 

10 The GALr4 DNA binding domain binds as a dimer to the palindromic 17-mer 

GAL-4 DNA binding site (CGGAAGACTCTCCTCCG). The Kd for binding of GAL-4, 
residues 1-100, is 3 nM (Reece and Ptashne (1993) Science 261: 909-911). Thus, in order 
to bind to the GAL-4 promoter and activate transcription of the inducible EPO plasmid, it 
is contemplated that a GENESWTTCH® regulatory protein having a GAL-4 DNA binding 

IS domain is required to form a homodimer. In the presence of the inducer, MFP, binding of 
MFP to the mutated hPR LBD may trigger a conformational change in the protein so as to 
initiate dimerization. 

As discussed in Example I, however, increases in hematocrit levels were observed 
in the presence of the GENESWITCH® v.3.1 protein even without the MFP ligand, 

20 indicating that the GENESWITCH® v3.1 protein may be able to dimerize and bind the 
GAL-4 binding site and induce transgene expression in the absence of MFP. Efforts were 
undertaken to reduce this drug-independent induction of expression by considering the 
tertiary structure of GAL-4 using molecular modeling and designing truncation mutant 
having lower dimerization potential but retaining sequence-specific DNA binding activity. 

25 Figure 5 depicts the structure of the GAL-4 protein DNA binding domain, residues 

1-93 SEQ. ID. NO. 10; residues 2-93 of which has been incorporated into the 
GENESWITCH® version 3.1 plasmids (correlating to the underlined nucleic acid 
sequence in Figure 7, SEQ. ID. NO. 12, and amino acid sequence in Figure 8, SEQ. ID. 
NO. 14). The DNA recognition unit (residues 9-40) is boxed, with the cysteine (C) 

30 residues involved in chelating zinc shown in bold. The coiled-coil stmctures that form the 
dimerization elements (residues 54-74 and 86-93) are also boxed, with the generally 
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hydrophobic first and fourth positions of each heptad repeat sequence shown in bold. 
Residue Ser 47 and Arg 51, which form an H-bond between chains, are marked by carats. 

The first seven residues of the GAL-4 DNA binding domain are disordered and are 
not known to contribute any function, while residues 8-40 form the Zn binding domain or 
5 the DNA recognition unit. This unit has two alpha helical domains that form a compact 
globular structure and in the presence of Zn resulting in a structure that is a binuclear 
metal ion cluster rather than a zinc finger, i.e., the cysteine-rich amino-acid sequence 
(Cys^^-Xaa2-Cys^^-Xaa6-Cys^^-Xaa6-Cys^^-Xaa2-Cys^^^Xaa6-Cys^^> binds two Zn(II) ions 
(Pan and Coleman (1990) PNAS 87: 2077-81). The Zn cluster is responsible for making 

10 contact with the major groove of the 3 bp at extreme ends of the 17-mer binding site. A 
proline at 26 (cis proline) forms the loop that joins the two alpha-helical domains of the 
zinc cluster domain and is also critical for this function. 

Residues 41-49 exist as a disordered linker that joins the DNA recognition unit and 
the dimerization elements (54-74 and 86-93). Once dimerized, residues 47-51 of 

15 dimerized subunits also interact with phosphates of the DNA target Residues 50-64 are 
contemplated to be involved in weak dimerization. They consist of a short coiled-coil that 
forms an amphipathic alpha-helix and wherein two alpha-helices are packed into a parallel 
coiled-coil similar to a leucine zipper. In addition to hydrophobic interactions of 3 pairs of 
leucines and a pair of valines found within residues 54-74, there are two pairs of arg-glu 

20 salt links, and H-bonds between Arg 51 of one monomer to Ser 47 of the other monomer. 
Residues 65-93 are contemplated to form a strong dimerization domain. The structure of 
residues 65-71 has not been fully determined, but it is most likely a continuation of the 
coiled-coil structure for one heptad repeat Residues 72-78 contain a proline and therefore 
disrupt the amphipathic helix. Residues 79-99, however, contain three more potentially 

25 alpha-helical heptad sequences (Marmorstein et al (1992) Nature 356: 408-414). 

There are a number of possible modifications that can be made to the regions of the 
GAL-4 domain as discussed above. Modifications in these regions may result in lower 
basal expression but still retain sequence-specific DNA binding. For example, the length 
of the region that contains the interacting coiled-coil sequences (residues 54-74 and 

30 residues 86-93) could be shortened by deletion such as deleting amino acid sequence 54- 
64, 65-74, 54-74, or 86-93. As such, GAL-4 mutants with only one coiled-coil region 
could be constructed by deleting one of the coiled-coil regions. In addition, mutant or 
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artificial sequences may also be used to replace the fragment GAL-4 domain with unique 
restriction sites positioned at key spots, for example at the junctions of each of the alpha- 
helical heptad sequences. Thus, modified versions of the GAL-4 protein domain could be 
produced that have progressively reduced alpha-helical heptad sequences. 

5 In one exemplary embodiment of the present invention, the GAL-4 domain was 

truncated by deletion of amino acids 75-93. This was achieved through the use of a 
convenient restriction endonuclease sites (Hinc n and Xho I) although other truncations 
may be produced according to molecular biology techniques known to those of skill in the 
art. By deleting the 72 bp XhoI-ffincH fragment from pGS1210, pGS1539 (CMV- 

10 GENESWrrCH® v.4.0) was generated, which has a 19 amino acid truncation at the C- 
terminal portion of the GAL-4 DNA-binding domain (the deletion corresponding to amino 
acid sequence 75-93 of SEQ. ID. NO. 10 in Figure 5). Figure 7 shows the nucleic acid 
sequences of the GENESWTTCH® coding region of both GENESWITCH® v.3.1 and 
v.4.0, SEQ. ID. NOS. 12 & 13, while Figure 8 shows the amino acid sequences of the 

15 same SEQ. ID. NOS. 14 & 15. Molecular modeling indicates that deletion of residues 75- 
93 removed the C-terminal helical portion without affecting the helical and coiled 
structure of the upstream amino acid sequences. 

Hgure 6 shows the effect of this C-terminal truncation of the GAL-4 DNA binding 
domain. COS-1 cells were transfected with pEP1556 (intact inducible promoter) and with 

20 either pGS1210 (CMV-GSv3.1) or pGS1539 (CMV-GSv4.0). When comparing the effect 
on EPO protein expression by GENESWITCH® v3.1 and v4.0 with the intact inducible 
promoter (pGS1210/pEP1556 vs pGS1539/pEP1556), a 5-fold increase (from 5X to 28X) 
in induction was observed mainly due to a decrease in basal EPO expression (57153 to 
9387 mlU/ml), brought about by the truncation of the GAL-4 DNA binding domain. 

25 It is notable that a 1-74 amino acid GAL-4 domain has been reported in a regulator 

protein chimera having a native estrogen-receptor ligand-binding domain and a VP- 16 
transactivation domain. Webster, J. et al., Cell 54:199 (1988). However, this construct 
was reported to have only 50% of the activity of a construct having amino acids 1 - 147 of 
the GAL-4 domain. In contrast, the present construct having GAL-4 (2-74) together with 

30 a mutated progesterone receptor ligand-binding domain was as active as a GAL-4 1-93 
construct in the presence of ligand while having lower background in the absence of 
ligand. 
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Figure 7 shows the nucleic acid sequences of the GENESWITCH® coding region 
of both GENESWITCH® plasmids v.3.1 (SEQ. ID. NO: 12) and v.4.0 (SEQ. ID. NO: 13), 
while Figure 8 shows the amino acid sequences of the same, GENESWITCH® proteins 
v.3,1 (SEQ. ID. NO: 14) and v.4.0 (SEQ. ID. NO: 15). As is apparent in Figure 8, the N- 
5 terminal methionine of the native GAL-4 sequence have been removed and a further eight 
amino acids have been added to the N-terminal end of the GENESWITCH® protein. 
Modifications to the N-terminal seven amino acids region are not of consequence as long 
as they do not affect the tertiary structure of residues 8-40 of the Zn binding domain. 

10 EXAMPLE 5: Modification of the Transgene Core Promoter 

The core region of the inducible promoter in the regulated EPO plasmid was also 
modified to reduce the basal expression of EPO without reducing the inducibility of EPO 
expression. This was desirable because increases in hematocrit level were observed when 
the original inducible mEPO plasmid was delivered by itself (in the absence of 

15 GENESWITCH® plasmid) to mice. This increase in hematocrit level was found to be 
dose dependent such that at higher plasmid doses, sufficient EPO was produced to result in 
an increase in hematocrit level from the EPO plasmid alone in the absence of the 
GENESWITCH® plasmid. It was, thus, desirable to develop a system in which the basal 
expression from the EPO plasmid was significantly reduced in order to increase reliance 

20 on administration of the inducer drug and provide an increased margin of safety by virtue 
of extrinsically controlled expression rather than through dependence on plasmid 
administration dose. 

The promoter of an inducible EPO plasmid usually contains 6X GAL-4 sites linked 
to a TATA box. (Figure 9A and B) Different numbers of GAL-4 sites may be employed 

25 and the optimal number may be determined empirically. Figure 9B depicts the nucleotide 
sequence of an inducible 6X GAL-4/TATA promoter (SEQ.ID.NO. 25). The six GAL-4 
elements (17 bp in length) are boxed, the TATA box (-29 to -24) is double-underlined, 
and the predicted transcription initiation site (+1) is marked by the arrow. The sequence 
from -33 to -22, which contains the TATA box, is from the Elb region of Adenovirus 

30 type 2 (residues 1665-1677 of NCBI accession no. JO 19 17). 

For this experiment, human, mouse or rat EPO coding sequences, adjusted for 
codon usage and removal of cryptic splice sites, were synthesized by Operon 
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Technologies, Inc. (Alameda, California) and cloned into plasmids with the inducible 6X 
GAL-4yTATA box promoter. One inducible munne EPO plasmid is pEP1442 having a 
core promoter shown schematically in Figure 9 A wherein relevant regions of the inducible 
promoter with unique restriction sites are indicated. The core promoter also consists of 6X 

5 GAL-4 sites linked to an Ad Elb TATA box joined to a CMV sequence that contains the 
putative initiator (inr) region of the CMV promoter (Macias et al., Joum. of Virol. 
70(6):3628 (1996)). Just downstream of this inr region is UTU (5' untranslated region 
(UTR) of CMV, +1 to +112). The high activity of this core promoter has been observed to 
result in drug-independent increases in the hematocrit level of mice. 

10 Utilizing unique restriction endonuclease sites engineered into the core promoter 

region, different regions of the promoter and 5' UTR were deleted to determine what 
effect, if any, this would have on overall expression of the transgene in the presence or 
absence of the GENESWTTCH® protein. Figure 9A indicates schematically the unique 
restriction endonuclease sites used to delete the TATA box (Sal I/EcoR V), inr (EcoR 

15 V/BsmB I) and part of the UT12 (Sac Il/Pac I). Figure 9C depicts the sequence of the 
inducible promoter, SEQ. E). NO. 17, with relevant regions and unique restriction enzyme 
sites boxed. BsmBI enzyme cleavage sites are indicated with arrows. "TP" refers to 
transcription factor binding site, and "inr" refers to initiator. 

To reduce the basal activity of the inducible promoter, a 30 bp BsmBI-EcoRV 

20 fragment (13-42 bp downstream from the TATATAAT box) was deleted. The resultant 
inducible mEPO, rEPO and hEPO plasmids were pEP1705, pEP1642 and pEP1596, 
respectively. In the case of the human EPO plasmid (pEP1596) having a deletion in the 
inr region. Figure 6 shows that the level of EPO expression from pEP1596 (73 mlU/ml) 
was 7 - lOX lower than with the level of EPO expression from pEP1556 (529 mlU/ml) 

25 (intact inducible promoter) in the absence of a GENESWITCH® plasmid. 

On the other hand, when the plasmid pEP1596 was co-transfected with the 
pGS1210 plasmid (GENESWITCH® v. 3.1), the inducibility of the EPO expression was 
increased from 5X inducibility to 7X inducibility. Hence, a deletion in the transcription 
initiation region (inr) of the inducible EPO plasmid significantly reduced the intrinsic 

30 activity of the promoter without impairing, and may even enhance, its ability to be 
induced. 
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EXAMPLE 6: Combination of Modifications to the Truncated GAL-4 DNA-binding 
domain and the modified Transgene Core Promoter 

Figure 6 also shows that when the modifications to the truncated GAL-4 DNA- 
5 binding domain and inr-deleted transgene core promoter were combined, there was an 
even higher level of expression inducibility (37X inducibility for pGS1539 and pEP1596) 
compared to the truncated GAL^ DNA binding domain alone (28X inducibility for 
pGS1539 and pEP 1556) and the inr-deleted transgene core promoter alone (7X 
inducibility for pGS 1210 and pEP1596). Again, this was attributed to a further decrease 
10 in basal EPO expression (pGS1539/pEP1596 level of 4650 mlU/ml in the absence of 
MFP), 

EXAMPLE 7: Muscle Specific Promoter in the GENESWITCH® Plasmid 

Where expression in a particular tissue is desired, strong non-tissue specific 

15 promoters may be replaced with tissue specific promoters. For example, if the target 
tissue for gene expression is muscle, an actin promoter may be employed. Several 
advantages may be gained through the use of tissue-specific promoters. In a particular 
tissue, such as for example, muscle tissue, use of muscle-specific promoters may increase 
the fidelity of expression. Tissue-specific promoters may be expected to decrease the 

20 potential for unscheduled gene expression in non-target tissues. In particular, tissue- 
specific promoters may provide the advantage of reduced expression in dendritic and other 
antigen presenting cells, thus avoiding immune responses to the expressed proteins. In 
certain circumstances, a low level of regulator plasmid expression may also be desirable. 
In a combination plasmid system, it is also preferable to regulate the level of transgene 

25 expression by inherent properties of the plasmid delivered rather than by attempting to 
variably titrate the dose of plasmid delivered. 

As the level of basal expression without MFP was found to increase with 
increasing plasmid dose, it was desirable to reduce the amount of GENESWITCH® 
protein .made by utilizing a weaker promoter. la muscle tissue for example, the avian 

30 skeletal alpha-actin promoter results in less gene expression that the CMV promoter. In 
one embodiment, the CMV promoter of the GENESWITCH® plasmid was replaced with 
a muscle-specific promoter, avian skeletal alpha-actin promoter (SK promoter) (SEQ. ID. 
NO. 16) to reduce the level of regulator protein produced. The avian skeletal alpha-actin 
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promoter is described in US Patent No. 5,298,422, incorporated herein by reference in its 
entirety. 

In another embodiment, the muscle-specific avian skeletal alpha-actin promoter 
was combined with an optimized arrangement of post-transcriptional elements (5* 
5 untranslated region (UT12), syndietic intron (TVS 8) and poly(A) signal (hGH pA)). 

As depicted in Figure 2, the improved plasmids (pGS1633, pGS1694) code for 
GENESWrrCH® regulator protein v.4.0 constructed by deleting about 20 C-terminal 
residues of the GAL-4 region (indicated by the inverted triangle in Figure 2) together with 
replacement of the CMV promoter with a muscle-specific promoter. Muscle-specific 
10 GENESWirCH® v.4.0 constructs (pGS1633 and pGS1694, identical except for minor 
variations in plasmid backbone) were generated by replacing the CMV promoter with the 
chicken skeletal alpha-actin promoter (base pairs -420 to -1), 

Use of the muscle specific promoter for the molecular switch expression cassette 
resulted in lower basal expression in the absence of inducer and higher level with 
15 induction. With the muscle-specific system (pEP1596/pGS1633), hEPO expression was 
regulated in a manner that was strictly dependent on the inducing drug, MFP. No hEPO 
expression or increase in hematocrit level was detected in the absence of MFP treatment, 
and when MFP was administered, hEPO was induced to high levels (>3Q0 mU/mL), which 
caused hematocrit to increase from 45% to 60%. In contrast, with the CMV- 
20 GENESWrrCH® system (pEP1596/pGS1539), hEPO expression was regulated in a 
manner that was significantly less robust. Low levels of hEPO expression and increases in 
• hematocrit level occurred in the absence of MFP treatment, and when MFP was 
administered, the level of induced hEPO expression was modest (-50 mU/mL). 

25 EXAMPLES 

In a preferred embodiment that combines the various improvements discussed 
above, a muscle-specific GENESWTTCH® plasmid having a truncation of GAL-4 DNA 
binding domain (DBD2-74) (pGS1694) may be used together with a regulated human EPO 
expression plasmid having the inr deletion (pEP1666). The GENESWITCH® regulator 
30 protein plasmid may utilize an a-actin muscle specific promoter upstream of the regulator 
protein coding sequences that include a truncated GAL-4 domain, a mutated progesterone 
receptor having a C-terminal deletion of 19 amino acids and a p65 transactivation domain. 
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A map of an exemplary plasmid having this construction, SK GENESWTTCH® plasmid 
version 4.0 (pGS1694) is shown in Figure 11 with the corresponding sequence on Figure 
13 SEQ. ID. NO. 22 (the GAL-4 sequence is bold in italics, the mutated steroid binding 
domain is underlined, and the p65 transactivation domain is in bold). A map of an 
5 exemplary regulated human-EPO plasmid, pEP1666 is shown in Figure 12 SEQ. ID. NO. 
23 with the corresponding sequence on Figure 14 (EPO sequence in italics). 

Figure 15 depicts the results of administering a 1:1 mixture of the improved 
plasmids, pEP1705 (inducible mEPO with inr deletion) and pGS1694 (muscle-specific 
GENESWrrCH® v.4.0) at a total dose of 150 micrograms, which was formulated with 

10 sodium poly-L-glutamate and delivered with electroporation to both tibialis and 
gastrocnemius muscles of mice on day 0. Beginning on day 25, oral MFP (A) was 
administered chronically (daily on weekdays) at doses of 0.0 (0), 0.01 mg^g (•), 0.03 
mg/kg(A)or0.10mg/Kg(B). Blood samples were analyzed for hematocrit level. Each 
group contained five animals and data are expressed as mean ± SEM. 

15 In the absence of MFP treatment, serum levels of mEPO were undetectable and no 

significant increases in hematocrit level occurred in the animals from day 0 to day 25 (Fig. 
15). Thus, the improvements to the mEPO/GENESWITCH® system sufficiently reduced 
the level of basal expression, in the absence of an inducer, even when the plasmids were 
deUvered to mice at a high dose (7.5 mg/Kg body weight). Starting at day 25, when oral 

20 doses of MFP were administered for 6 weeks, induction of stable, statistically significant 
increases in mEPO expression and hematocrit levels were observed. Induced levels of 
mEPO were barely detectable at the lowest MFP dose (0.01 mg/Kg), but hematocrit levels 
gradually increased from 50 to 61% (Figure 15, (•) from day 25 to about day 37). This 
11-point increase, which was significant compared to the control group, was also 

25 maintained during the course of MFP treatment. 

At MFP doses of 0.03 or O.lOmg/Kg, induced levels of mEPO exceeded 20 
mU/mL and hematocrit levels increased to 73-78%. (Figure 15, (A) and (■) about day 
37 and day 49) These hematocrit levels were significantly higher than those of animals 
treated with 0.01 mg/Kg (•). When MFP dosing was ceased, niEPO expression 

30 immediately returned to undetectable levels and hematocrit levels began to decline. A 
complete return to baseline levels was observed to require at least 30 days. To summarize, 
strict, ligand-dependent regulation of hematocrit levels in mice following delivery of the 
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improved mEPO/GENESWITCH® plasmids was obtained in contrast to the unregulated 
hematocrit levels as discussed in Example I in conjunction with Figure 1. Furthermore, 
increases in hematocrit levels have been shown to be controlled by adjusting the dose of 
the orally administered ligand. 

5 

EXAMPLE9 

Inmiune responses resulting from administration of a gene encoding a foreign 
species protein can be analogized to immune responses occurring in humans receiving 
replacement therapy for a genetic defect characterized by absence of a normal functional 

10 protein such as the hemophilias. This scenario was modeled by administration of 75 mg 
of pEPlS49 (CMV-human EPO) formulated with sodium poly-L-glutamate and delivered 
with electroporation to both tibialis and gastrocnemius muscles of five mice. Figure 17A 
demonstrates levels of hEPO expression exhibited by 5/5 mice (mean ± SEM). Figure 
17B demonstrates hematocrit levels exhibited by 5/5 mice (mean ± SEM). As shown in 

15 Figure 17, when a human protein such as human EPO was produced in mice constitutively 
using a CMV-hEPO plasmid administered to muscle with electroporation, all animals 
developed erythroid hypoplasia, characterized by subnormal hematocrit, loss of hEPO 
expression and the appearance of anti-hEPO antibodies. 

The abijUty of a regulated system to control inamune responses to a foreign 

20 transgene.was explored. Induction of gene expression was delayed for various periods 
following gene administration. The results of long-term regulation of a foreign transgene 
(human EPO) in mice are shown in Figure 18. A 1:1 mixture of pEP1596 (inducible 
hEPO having deletion of the inr region) and pGS1633 (muscle-specific GENESWITCH® 
v.4.0 having a truncation in the GALr4 domain), total dose of 150 micrograms, was 

25 formulated with sodium poly-L-glutamate or saline and delivered with electroporation to 
both tibialis and gastrocnemius muscles of five mice. Each formulation group (saline or 
poly-L-glutamate) was then subdivided into two groups, where one group received its first 
mifepristone (MFP) dosing on day 13, and the other group received it on day 55. All MFP 
dosing regimens consisted of five consecutive days of oral administration of a 0.33 mg/Kg 

30 dose in sesame seed oil. After the initial MFP dosing, all animals were given MFP on the 
following days: 105-109, 154-158, 210-214. Finally, on day 268 a slow release MFP 
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pellet was implanted subcutaneously into all remaining animals. hEPO protein levels were 
measured four days after the first MFP dose and hematocrit levels were measured weekly. 

The saline formulated hEPO/SK-GS system was induced four times over a 270-day 
period by MFP. The hEPO protein level, in the 12-day lag group, was induced to -300 
5 mU/ml after each induction. Three of the five animals showed elevated hematocrit levels 
(increased from 45 to -55%) after all four inductions, while the remaining two animals in 
this group had elevated hematocrit levels after just the first and second inductions. The 
hEPO protein levels in the 54-day lag group were also induced to -300 mU/ml after each 
induction. However, four out of five animals had elevated hematocrit levels (-55%) after 

10 all four inductions. The remaining animal in this group responded to just the first two 
inductions. All three of the non-responder animals in these two groups went anemic 
(hematocrit level 5% below normal) sometime after the second induction. Very low levels 
of antibodies to hEPO (OD450 values <0.2S) were detected in all ten animals in these 
groups on days 109 and 113. 

15 The poly-L-glutamate formulated hEPO/SK-GS system was induced four times 

over a 270-day period by MFP. The hEPO protein level, in the 12-day lag group, was 
induced to -400 mU/ml after each dosing. Two of five animals showed elevated 
hematocrit levels (-60%) after all four inductions. Two of the remaining animals 
responded to just the first induction, while the final animal in the group responded to the 

20 first two inductions. The hEPO protein levels in the 54-day lag group were induced to 
-250 mU/ml after each dosing. Following rounds of MFP dosing initiated on days 104, 
154 and 210, hEPO levels increased to 249±51, 272±59 and 322±81 mll/mL, respectively, 
and hematocrits increased to 59+0.5%, 58±0.9 and 61±1.3%, respectively (Fig. 18). Each 
data point shown is the mean ± SEM of 4/5 animals. Figure 18A depicts levels of hEPO 

25 expression exhibited by 4/5 mice (mean ± SEM). Figure 18B depicts hematocrit levels 
exhibited by 4/5 mice (•, mean ± SEM) and one individual animal, (0), in which 
antibodies to the foreign EPO developed. 

Anemia was observed after the second round of inductions in all the non- 
lesponders in the two groups. A significant antibody response to hEPO was detected in 

30 only one animal fiom each group on day 113, and both animals went anemic after the 
second induction. 
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I 

On day 268, all of the animals that received the poly-L-glutamate formulated 
hEPO/SK-GS system, were administered a MFP pellet by subcutaneous implantation. 
Two of the 12-day lag animals responded to the pellet and achieved hematocrit levels of 
-75%. One animal had no change in hematocrit level, and the final two animals were 

5 hypocythemic. Three of the 54-day lag. animals responded to the pellet and achieved 
hematocrit levels of -75%. The hematocrit of one animal appeared to be declining 
towards anemia (after an initial increase in hematocrit to -65%) when the experiment 
ended, and there was no effect on the remaining animal that was already hypocythemic. 

These studies showed that when 1:1 (w/w) mixtures of inducible human EPO and 

10 muscle-specific GENESWITCH® plasmids (pEP1596/pGS1633 or pEP1666/pGS1694) 
were delivered to the tibialis and gastrocnemius muscles of C57BL/6 mice at doses as hi^ 
as 150 • g (-7.5 mg/kg body weight), no hEPO expression was detected and no increases 
in hematocrit occurred in the absence of MFP dosing. In one study, the lack of basal 
expression in the period between plasmid delivery and induction of hEPO expression was 

15 confirmed for 55 days. The lack of "leaky" expression in mice at plasmid doses of -7.5 
mg/kg body weight predicts that a sinailar lack of "leaky" expression will occur when 
plasmids axe delivered to humans at doses of 0.5 mg/kg body weight (the highest proposed 
human dose) or less. 

In summary, although immune responses to the hEPO protein occasionally 
20 occurred when hEPO/GENESWITCH® plasmids were delivered to mice, neutralizing 
responses were more moderate and less frequent In one study, neutralizing immune 
responses, characterized by the loss of inducible hEPO expression, a decline in hematocrit 
to subnormal levels, and the appearance of elevated levels of anti-hEPO antibodies, 
occurred in only 45% of the animals, and 65% retained the ability to induce hEPO 
25 expression and hematocrit for more than 8 months. In contrast, expression of CMV-hEPO 
leads to neutralizing immune responses in 100% of the animals. The tighdy regulated 
gene switch system disclosed herein permits is the imposition of a lag time between 
plasmid delivery and induction of hEPO expression. 

DeUvery of plasmids with electroporation is associated with transient damage to 
30 muscle fibers and inflammatory cell infiltrates. Thus, plasmids with constitutive 
promoters win express foreign transgene products in an inflammatory environment that is 
highly immunostimulatory. On the other hand, if transgene expression is delayed to a time 
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when inflammatory responses have subsided, as is done with the hEPO/GENESWlTCH® 
system, immune responses appear to be lessened considerably. 

Thus, by increasing the length of the lag period (time before inducing the foreign 
transgene) from 12 to 54 days, the incidence of erythroid hypocythemia and anti-hEPO 
5 antibody production can be decreased. 

EXAMPLE 10: Hematocrit regulation in rats in the improved system following delivery of 
low plasmid doses 

The next set of experiments was performed to determine the reproducibility of the 

10 mice results in larger animals such as a rodent Hence, plasmids were formulated with 
sodium poly-L-glutamate and delivered to the gastrocnemius muscles of rats with 
electroporation. Fischer female rats (250 g) were obtained from Harlan Sprague Dawley 
(Raleigh, North Carolina), Mifepristone (Sigma, St. Louis, Missouri) in sesame oil was 
administered to the rats (150-400 microliters, intraperitoneal). 

15 Figure 16A shows the results of an experiment using a constitutively expressing rat 

EPO (rEPO) expression plasmid. At day 0, 1:1 mixtures of pEP1641 (CMV-rEPO) and 
pEP1642 (inducible rEPO, added as filler) were formulated with sodium poly-L-glutamate 
and were administered at doses of 7.5 (A) or 25 micrograms (A) per animal and the 
hematocrit levels of the rats were monitored. As a control, naive animals (X) received no 

20 plasmid treatment. Following delivery of CMV-rEPO plasmid at doses of 7,5 or 25 
micrograms, hematocrit levels increased within 7 days and rapidly reached levels that 
exceeded 80%. rEPO expression was maintained for at least 91 days at 9±3 and 28±16 
mU/mL at the lower and higher plasmid doses, respectively. Hence, uncontrolled 
polycythemia resulted from the delivery of a constitutively expressed EPO plasmid even at 

25 a dose as low as 0.025 mg/Kg. This data emphasize the need for a tightly regulated EPO 
transgene expression system. 

Figure 16B depicts an improved regulated EPO expression system in rats 
according to one embodiment of the present invention. Using electroporation, 1:1 
mixtures of pEP1642 (inducible rat EPO) and pGS1694 (muscle-specific 

30 GENESWrrCH® v.4.0) were administered at doses of 7.5 (0), 25 (•), or 75 (□) 
micrograms per animal to rat gastrocnemius muscle at day 0. The days when MFP were 
administered to the animals are indicated as open triangles (A) at the X-axis of Figure 
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16B. For the first induction cycle, MFP was adtadnistered on days 18 - 22 at 1 mg/Kg, and 
on days 25 - 29 at 5 mg/Kg. For the second induction cycle, MFP was administered at 1 
mg/Kg on each weekday, for 6 weeks, beginning on day 73. Blood samples were assayed 
for hematocrit levels. Each group contained five animals and data are expressed as mean 

5 ± SEM. To induce rEPO expression, we administered MFP at a relatively high dose (1-5 
mg/Kg), since it is cleared rapidly in rats (half-Ufe of 1 hr, Deraedt, R. et al. in The 
Antiprogestin Steroid RU486 and Human Fertility Control, E. E. Balieu, S. Segal, Eds. 
103-122, Plenum Press, New York, 1985). 

During the 17 days immediately after plasmid delivery, no significant change in 

10 hematocrit level occurred in contrast to the first 25 days shown in Figure 16A. When 
MFP was orally administered for two weeks starting on day 18, hematocrit levels 
increased following administration and peak hematocrit levels were observed to have 
occurred on day 33, four days after the final MFP dose in the first induction cycie. 
Unexpectedly, the lowest plasmid dose (0.025 mg/Kg (0)) yielded a hematocrit response 

15 that was indistinguishable from that achieved by the highest plasmid dose (0.8 mg/Kg (□ 
)). Hematocrit values of all treated groups were significanfly different than naive groups 
(X) and remained elevated for approximately 1 month before returning to baseline levels. 

When MFP was re-administered at a lower dose (1 mg/Kg for 4 weeks, beginning 
on day 73), re-induction of hematocrit levels was obsefved. Peak hematocrit levels 

20 occurred two weeks after the completion of MFP dosmg, and then began to decline. At 
the higher plasmid doses, the increases in hematocrit level were significant compared to 
naive animals. However, at the lowest plasmid dose, where a more modest response was 
observed, the increases in hematocrit level did not achieve statistical significance. This 
data demonstrate that durable, ligand-dependant regulation of hematocrit levels can be 

25 achieved in rats by low plasmid doses of the improved regulated EPO expression system. 

EXAMPLE 11: Regulation of human EPO expression in larger animals 

To further assess the applicability of the improved EPO/GENESWITCH® system 
in larger animals, tiie plasmid-based hEPO/GENESWITCH® v.4,0 system was tested in 
dogs. On day 0, 1:1 mixtures of pEP1596 (inducible hEPO with an inr-deleted region) and 
30 pGS1633 (muscle-specific GENESWITCH® v.4.0) were formulated witii sodium poly-L- 
glutamate, injected into 2 or 6 skeletal muscle sites of dogs, and then subjected to 
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electroporation with a 6-needle electrode array. The use of the needle electrode device 
facilitates electroporation of muscles in larger animals. In one case, 6 mg of formulated 
plasmid mixture was delivered to two sites (semimembranosus muscles) per dog (n=3). In 
another case, 18 mg of formulated plasmid mixture was delivered to six sites (triceps, 

5 semimembranosus, semitendanosus muscles) per dog (n=4). Six days following plasmid 
delivery, MFP (1 mg/kg) was administered orally and serum hEPO levels were measured. 

During the 6 days following delivery of the 6 mg plasmid dose but before the 
administration of MFP, hEPO expression was undetectable. Following two days of oral 
MFP dosing, however, an induction of serum hEPO was observed. Four days after 

10 cessation of the 3-day MFP treatment, hEPO levels dropped to near baseline levels. When 
MFP was re-administered on days 12-17, hEPO levels were again induced to similar levels 
as before. A similar pattern of expression was also observed using an 18 mg plasmid dose 
except that the induced levels of hEPO were 4 to 6-fold higher. Thus, the 
hEPO/GENESWrrCH® system functions effectively in a larger animal. 

15 However, no increase in hematocrit level was observed in any of the dogs, fiistead, 

anti-hEPO antibodies appeared in all dogs by 3-4 weeks, which apparently neutralized the 
biological response. The appearance of antibodies may also account for the decline in 
hEPO levels that occurred in the midst of the second period of MFP dosing. This data 
showing the presence of neutralizing immune responses in all of the dogs are in contrasts 

20 with the lack of neutralizing immune responses in most of the mice treated with the 
hEPO/GENESWrrCH® system (Fig. 18). 

In addition to the difference in species, one other factor may contribute to the 
difference in the results in dogs and mice. In dogs, the MFP treatment was initiated after 
only six days fix)m the day the plasmids were introduced into the animal. In contrast, MFP 

25 treatment was initiated in mice 55 days after the plasmids were introduced. As discussed 
in the experiments with mice, increasing the lag time between the time the plasmids are 
introduced and the start of MFP dosing significantly decrease the occurrence of erythroid 
hypoplasia. Thus, the length of tune between plasmid delivery and induction of transgene 
expression may be a critical factor in preventing immune response to the foreign 

30 transgene. It is desirable then to lengthen the delay between the introduction of the 
plasmids and the start of the MFP to at least 12 days. More preferably, the lag period 
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should be about 20 days (as shown in rats with Figure 17B), and even more preferably to 
about 55 days as shown in mice (Figure 18B). 



EXAMPLE 12 

Table 1 below describes the fundamental characteristics of the various test 
plasmids used in the development of a tightly regulated expression system while the 
nucleic acid sequences of the inducible promoter regions of the plasmids are set out in 

10 Figure 10 (pEP1556 - intact inducible promoter region: TATA box/initiator region 
(inr)/UT12 transcription factor binding site (UT12-TFBS)(SEQ. ID. N0.18); pEP1595 - 
Sall/EcoRV deletion of the TATA box promoter region:inr/UT12-TFBS (SEQ. ID. 
N0.19); pEP1596 - EcoRV/BsmBI deletion of the inr region: TATA box/UT12-TFBS 
(SEQ. ID, NO.20); pPE1597 - SacII/PacI deletion of the UT12 region: TATA box/inr 

15 (SEQ.ID.N0.21). 



Table 1: Plasmids used in testing the expression plasmid modifications 



Name 


Promoter 


GALA 
domain 


Promoter 

modificaition 


Transgene 


pEP1556 


6xGAL- 
4/rATA 




none 


hEpo 


PEP1595 


6xGAL4 




deleted TATA box 


hEpo 


pEP1596 


6xGAI^ 
4yTATA 




deleted inr region 


hEpo 


pEP1597 


6xGAI^ 
4yTATA 




deleted Un2 region 


hEpo 


PEP1549 


CMVenh/pro 




none 


hEpo 


pGS1210 


CMVenh/pro 


2-93 


none 


GENESWirCH® 
v3.1 


pGS1539 


CMVenh/pro 


2-74 


none 


GENESWirCH® 
v4.0 


pGS1382 


4xGAL^tk 


2-93 


none 


GENESWirCH® 
vS.l 


pGS1598 


4xGAL4/tk 


2-74 


none 


GENESWTTCH® 
v4.0 



The resultant inducible mEPO, rEPO and hEPO plasmids are pEP1705, pEP1642 
20 and pEP1596, respectively. 
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EXAMPLE 13 

In another embodiment of the present invention, the improved molecular switch 
protein may have the amino acid sequence: MXJ [yGAL-4 2.74] / / [hPR 640-914] / Xn / 

5 [hP65 285-55i] as exemplified by SEQ.ID.NO:24 wherein Xn represents a series of amino 
acid sequence. X can be any amino acid and n can be any number but preferably between 
1-10. In another embodiment of the present invention, the improved molecular switch 
protein, encoded for example in GS v4.0 has the amino acid sequence MDSQQPDL / 
[yGAL-4 2.74] / DQ / [hPR 640-9141 / GST / [hP65 285.551] as exempUfied by 

10 (SEQ.ID.N0:15). Amino acids for the N-terniinus and linker regions are indicated by 
single letter abbreviations. Lidividual protem components are bracketed and the amino 
acid residues are specified in subscript yGAL-4 is the N-terminal DNA binding domain of 
the yeast GALr4 protein (GenBank accession no. AAA34626). hPR is a C-terminal 
truncated portion of the ligand binding domain of the human progesterone receptor 

15 (GenBank accession no. AAA60081), where hPR64o-933 is the full-length version of the 
hPR-LBD. hP65 is the activation domain of the p65 subunit of human NFkappaB 
(GenBank accession no. AAA46408). The human components of the GENESWITCH® 
regulator protein comprise 86% of its sequence. 

EXAMPLE 14: Expression of EPO Using Polymer Formulations and Electroporation 
20 Plasmid DNA/anionic polymer formulations were preferably prepared by 

aliquoting appropriate volumes of sterile stock solutions of plasmid, anionic polymer and 

5M NaCl to obtain selected final plasmid and anionic polymer concentrations. The 

anionic polymer was added to the DNA solution prior to adding salt for tonicity 

adjustment. Thus, poly-L-glutamate formulations are preferably prepared by combining 

25 an aqueous stock solution of sodium poly-L-glutamate (sodium salt of poly-L-glutamic 
acid) with a stock solution of purified plasmid DNA in saline or up to lOmM Tris, pH 7.5. 
After the poly-L-glutamic acid and DNA are combined, the solution is adjusted to a final 
concentration of 150mM NaCl by addition of a stock solution of 5M NaCl. 

The ability of poly-L-glutamate to increase the expression of a non-viral 

30 erythropoietin ("EPO") gene deliv^y was undertaken. Using quantitative polymerase 
chain reaction (qPCR) analysis, plasmid formulated in poly-L-glutamate resulted in at 
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least a log increase in levels of mEPO DNA compared with animals receiving a 
saline/DNA formulation. 

Animals received CMV-mEPO formulated either in 15-50 kDa poly-L-glutamate 
or in saline, Plasmid fonnulations were injected intramuscularly in each leg, 25 
5 microliters in each tibialis, 50 microliters in each gastrocnemius followed by 
electroporation 2 min after injection (375 V/cm (113 V/0.3 cm), 2 pulses, 25 msec pulse 
length. At defined time intervals, blood was collected by retro-orbital methods and 
hematocrit levels determined or the serum assayed for EPO levels. 

At indicated times, total muscle DNA was extracted and levels of were quantified 

10 by qPCR as follows: Plasmid DNA quantities in mouse muscles were determined by 
conducting TaqMan® real time quantitative PGR (Applied Biosystems, Foster City, CA) 
on isolated DNA samples as previously described (Mahato, R.L et al. Hum. Gene Ther. 
9:2083 (1998)). The primers used in the PGR were a forward primer, which primes in the 
5* untranslated region, and a reverse primer, which primes in the mouse EPO coding 

15 region. The probe sequence was located within the EPO gene. Purified CMV-mEPO 
plasmid DNA was used to generate a standard curve for the PCR assay. As shown in 
Figure 19, formulation in poly-L-glutamate results in a several fold increase in the amount 
of plasmid DNA that can be detected in tissues after electroporation. 

For mEPO expression determination, 75 mg CMV-mEPO plasmid DNA in 150 ml 

20 was delivered to C57BL/6 mice, 25 microliters per tibialis, 50 microliter per 
gastrocnemius. Plasmid was formulated in saline or 6 mg/mL poly-L-glutamate. Figures 
20 and 21 depict mEPO expression and Figure 20 also depicts the hematocrit level in mice 
following delivery of the mouse EPO gene by electroporation using saline and sodium 
poly-L-glutamate formulations. 

25 As shown in Figures 20 and 21, delivery in a poly-glutamate formulation results in 

considerably higher levels of expressed protein than when the plasmid DNA is delivered 
in saUne. Because a very small amount of erythropoietin is required to give a maximal 
increase in hematocrit, the induced hematocrit levels shown on Figure 20 do not differ 
between saline and polyglutamate formulations. However, because polyglutamate results 

30 in more efficient transfection, lower amounts of DNA can be administered using 
polyglutamate formulations. 



wo 02/24899 . PCT/USOl/30305 

49 

EXAMPLE 15: Increased duration of human factor DC protein expression following 
pulsatile expression using the GENESWITCH® system. 

Experiments were also performed to extend the improved tightly regulated 

5 transgene expression system to other therapeutic gene aside from EPO. Plasmids 
encoding human factor IX (hF.IX) and an improved muscle-specific GENESWITCH® 
plasmid, according to the present invention were mixed together in a 1:1 ratio and 
formulated with a polymer formulation (poly-L-glutamate, 6mg/ml). The plasmids at a 
concentration of 1 mg/ml were injected into the tibialis of the hind legs of female CD-I 

10 mice (25 microliters each muscle for a total of 50 microliters per animal). Following the 
injection the muscles were electroporated using caliper electrodes and applying 2 x 
^ 375V/cm square pulses for 25 msec each. 

In the constitutive expression group, plasmid encoding for hFJX under control of 
the CMV promoter was injected in a manner similar to first group of animals. Figure 22 

15 shows expression of hF.K driven by the CMV enhancer promoter in CD-I mice. 
Expression peaked 7 days after injection. Similar to data obtained by constitutively 
expressing EPO in mice, by 12-15 days after mjection hF.IX levels are back to baseline 
due to an immune response against hF.IX. 

In contrast. Figure 23 shows the regulated expression of hFJX in CD-I mice 

20 according to one embodiment of the present invention. Initially, hF.IX levels were 
undetectable for four weeks prior to MFP activation of the GENESWITCH® system. The 
first administration of mifepristone (filled triangles) was 28 days after injection of the 
plasmids. After MFP treatment, the plasma levels of hF.IX increased over time. Even 
after 20 days fix)m the start of MFP administration, the levels of hF.IX were still 

25 increasing. This was in contrast to the results using the CMV-hF.IX construct where 
expression was undetectable by 12 days after injection and electroporation. Furthermore, 
in the GENESWITCH® group, hF.IX levels were undetectable prior to activation of the 
system. After 4 weeks, hF.IX expression was activated by administration of mifepristone 
leading to detectable levels of hF.IX (Figure 23). 

30 In this particular example, removal of the drug for 4 days did not lead to a return to 

undetectable levels of hF.IX. Following two rounds of a pulsatile administration of 
mifepristone (3 days on and 4 days off) a more chronic administration protocol was 
adopted (5 days on and 2 days off). This led to increasing levels of hFJX expression that 
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reached a plateau level approximately 20 days after the initial mifepristone administration. 
Thus, Figure 23 shows hF.IX expression that has been maintained for approximately 35 
days which is clearly much longer than the duration of expression achieved in the 
constitutively active CMV group. 

5 The development of a neutralizing immune response was also dramatically reduced 

by the GENESWITCH® regulated hF.IX expression and was further reduced by 
lengthening the lag period between plasmid delivery and the iBrst induction of foreign 
transgene expression. The lag time necessary may be different with different delivery 
modalities. Delivery of plasmids with electroporation is associated with transient 

10 inflammation and cell infiltration. Inflammation may contribute to anti-hF.lX antibody 
production. The transient inflammation period following electroporation may last for 
approximately one month. Hence, it is preferred that foreign transgene expression be 
induced after inflammation at the muscle site has subsided. In addition, a pulsatile 
program of transgene expression may also reduce the neutralizing response by depriving 

15 the immune system of a source of antigen at certain critical times. 

EXAMPLE 16: Regulated IFN-alpha gene therapy using the plasnoid-based 
GENESWITCH® system 

20 For the GENESWITCH® regulated IFN system two plasmids were utilized. The 

GENESWITCH® regulator protein used was a chimera of . a truncated yeast GAL4 DNA 
binding domain (GAL4), the truncated human progesterone receptor ligand-binding 
domain and activation domain from the p65 subunit of human NF-kappaB (p65). The 
plasmid contains a muscle-specific promoter (avian skeletal a-actin promoter): The 

25 inducible plasnnd pIF1681 encoding for human IFN-alpha2b (hIFN-alpha2b) contains the 
coding sequence for hIFN-alpha2b (NM_000605) inserted into a plasmid backbone that is 
essentially the same as that of pEP1666. 

To induce expression, mifepristone in sesame oil was administered to mice 
(100ml) orally by gavage at a 0.3 mg^g dose. To prepare plasmid for injection, the 

30 plasmids were made in saline or formulated with 6 mg/ml poly-L-glutamate. In mice the 
injected volume of plasmid was 25 microliters into the tibialis and 50 microliters in the 
gastrocnemius muscle of the hindlimb. Following injection, caliper electrodes were 
positioned around the leg and compressed until snug (3-4 mm diameter between plates). 
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Two 25 millisecond pulses at a voltage of 375 V/cm were then administered with a 700 
millisecond delay between pulses using a T-830 Electro Square Porator (Genetronics, Inc., 
San Diego, CA). Colorimetric sandwich ELISAs where used for detection of interferon 
alpha protein and are commercially available. 
5 Figure 24 depicts the results of mifepristone-dependent regulation of hlFN-alpha 

expression in C57BU6 mice. Plasmids for the hlNF-alpha/GENESWITCH® system (1:1 
mixtures of pIF1681 and pGS1694) were formulated with 6 mg/ml poly-I^glutamate in a 
total volume of 23 microliters and injected into the tibialis of mice followed by 
electroporation (375 V/cm. 2 pulses 25 ms each pulse). Total plasmid dose was 150 
10 micrograms. Mifepristone dosing (0.33 mg/kg in 100 microliters given orally by gavage) 
was initiated 6 days after plasmids were delivered. Mean levels of hINF-alpha protein (± 
sem) are shown. 

Prior to MFP dosing, hlFN-alpha levels were undetectable. lii response to the first 
week of MFP dosing, hlFN-alpha protein was induced to a mean level of -1.2 ng/nil. 

15 During the second and third weeks of MFP dosing, the induced level of hlFN was lower, 
reaching a mean value of -200 pg^ml. During the second and third months, mean induced 
levels of hlFN protein were 200-700 pg/ml. 

All patents and publications are hereby incorporated herein by reference as if they 
are fully set forth herein to the same extent as if each individual publication was 

20 specifically and individually indicated to be incorporated by reference. 

One skilled in the art will readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
those inherent therein. Transformed cells, vectors, compositions, molecular switches and 
receptors, along with the methods, procedures, treatments and molecules described herein 

25 are exemplary and representative of preferred embodiments. They are not intended as 
limitations on the scope of the invention. Hence, changes to and combinations of the 
examples describe herein are encompassed within the spirit of the invention as defined by 
the scope of the claims. 
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CLAIMS 

1. An inducible expression system comprising: 

a first expression cassette having a first nucleic acid sequence encoding for a 
fusion protein comprising 
5 a mutated DNA-binding domain, 

a transregulatory domain, and a mutated ligand-binding domain of a steroid- 
hormone receptor; and 

a second expression cassette having a second nucleic acid sequence encoding a 
desired product, wherein fusion protein is activated in the presence of an inducer and the 
10 expression of the second nucleic acid sequence is controlled by binding of the activated 
fusion protein on a promoter region of the second nucleic acid sequence. 

2. Hie inducible expression system of claim 1 wherein the mutated DNA binding 
domain is a mutated GAL-4 DNA binding domain that carries a mutation in a helical 

15 domain located in the region from amino acid 54 - 93 of SEQ ID NO: 10. 

3. The inducible expression system of claim 2 wherein the mutation is selected from a 
group consisting of a deletion, substitution, or addition. 

20 4. The inducible expression system of claim 3 wherein the mutation is a deletion of 
the amino acid 75-93 of SEQ ID NO: 10. 

5. The inducible expression system of claim 4 wherein the mutated GAL-4 DNA 
binding domain comprises amino add sequence 2-74 of SEQ ID NO: 10. 

25 

6. The inducible expression system of claim 3 wherein the mutation is a deletion of 
amino acids 54-74 of SEQ ID NO: 10. 

7. The inducible expression system of claim 3 wherein the mutation is a deletion of 
30 amino adds 54-64 of SEQ ID NO: 10. 

8. The inducible expression system of claim 3 wherein the mutation is a deletion of 
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9. The inducible expression system of claim 3 wherein the mutation is a deletion of 
amino acids 86-93 of SEQ ID NO: 10. 

5 

10. The inducible expression system of claim 1 wherein the first expression cassette 
further comprises a tissue-specific promoter. 

IL The inducible expression system of claim 10 wherein the tissue-specific promoter 
10 is a muscle-specific promoter. 

12. The inducible expression system of claim 11 wherein the muscle-specific promotea: 
is an a-actin promoter. 

15 13. The inducible expression system of claim 1 wherein the first expression cassette 
further comprises a 5* untranslated region, a synthetic intron, and a poly (A) signal 
sequence. 

14. The inducible expression system of claim 13 wherein the 5' untranslated region 
20 comprises SEQ ID NO: 8. 

15. The inducible expression system of claim 13 wherein the synthetic intron 
comprises a sequence selected firom a group consisting of SEQ ID NO: 5, SEQ ID NO: 6, 
and SEQ ID NO: 7. 

25 

16. The inducible expression system of claim 13 wherein the poly (A) signal is derived 
firom the poly (A) signal of human growth hormone. 

17. The inducible expression system of claim 1 wherein the transregulatory domain is 
30 selected from a group consistmg of NFkappaBp65, VP-16, TAF-1, TAF-2, TAU-1, TAU- 

2,TEF-landORF-10. 
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18. The inducible expression system of claim 1 wherein the steroid-hormone receptor 
is a progesterone receptor. 

19. The inducible expression system of claim 18 wherein the steroid-hormone receptor 
5 is a human progesterone receptor having a mutation within amino acids 813 - 933 of the 

human progesterone receptor. 

20. The inducible expression system of claim 19 wherein the mutation is a deletion of 
a portion of the ligand-binding domain of the human progesterone receptor (hPR), and 

10 wherein the portion is selected from a group consisting of amino acids 813-933 of the 
hPR, amino adds 873-933 of the hPR, amino acids 891-933 of the hPR, and amino acids 
914-933 of hPR. 

21. The inducible expression system of claim 1 wherein the fusion protein comprises 
15 an amino acid sequence selected from SEQ ID NO: 15 and SEQ ID NO: 24. 

22. The inducible expression system of claim 1 further comprising an inducer capable 
of activating the fusion protein. 

20 23. The inducible expression system of claim 22 wherein the inducer binds to the 
mutated ligand-binding domain and causes the fusion protein to form a homodimer. 

24. The inducible expression system of claim 22 wherein the fusion protein is inactive 
in the absence of the inducer. 

25 

25. The inducible expression system of claim 24 wherein the fusion protein is a 
monomer in the absence of tlie inducer. 

26. The inducible expression system of claim 24 wherein the fusion protein is 
30 incapable of binding to the promoter region in the absence of the inducer. 



27. The inducible expression system of claim 22 wherein the inducer is a non-natural 
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ligand. 

28. The inducible expression system of claim 27 wherein the inducer is selected from a 
group consisting of lip-(4-dimethylaminophenyl)-17P-hydroxy-17a-propynyl-4, 9- 

5 estradiene-3-one (RU38486 or mifepristone); lip-(4-dimethylaminophenyl)-17a- 
hydroxy-17p-(3-hydroxypropyl)-13 a methyl-4, 9-gonadiene-3-one (ZK98299 or 
Onapristone); lip-(4-acetylphenyl)-17p-hydroxy-17a-(l-propynylH,9-estradiene-3-one 
(ZKl 12993); 1 iP-(4-dimethylaniinQphenyl>17P-hydroxy-17o&-(3-hydroxy-l(Z)"propen 
estra-4,9-diene-3-one (ZK98734); (7p,liP,17P)-ll-(4-dimethylaminophenyl)-7-methyl- 
10 4*,5'-dihydrospiro [ester-4,9-diene-17, T (3'H)-furan]-3-one (Org31806); (lip,14p,17a)- 
4\5'-dihydro-ll-(4-dime1hylanMnophenyl)-[spiroestra-4,9-diene-17,2X3'H)-^^ 
(Org31376); 5-a-pregnane-3,2-dione, Org 33628, and Org 33245. 

29. The inducible expression system of claim 1 wherein the promoter region of the 
15 second nucleic acid sequence comprises a GAL-4 binding site. 

30. The inducible expression system of claim 28 wherein the GAL-4 binding site 
comprises SEQ ID NO: 9. 

20 31. The inducible expression system of claim 29 wherein the promoter region further 
comprises a sequence selected from a group consisting of SEQ ID NOS: 18, 19, 20, and 
21. 

32. The inducible expression system of claim 1 wherein the promoter region of the 
25 second nucleic acid sequence comprises three to six repeating GAL-4 binding sites. 

33. The inducible expression system of claim 1 wherein the fusion protein is greater 
than 80% humanized. 



30 



34. The inducible expression system of claim 1 wherein the second nucleic acid 
sequrace comprises a gene selected from a group consisting of: erythropoeitin, a clotting 
factor and an interferon. 
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35. The inducible expression system of claim 34 wherein the clotting factor is a Factor 
IX. 

5 36. The inducible expression system of claim 34 wherein the interferon is an interferon 
alpha. 

37. The inducible expression system of claim 1 wherein the second expression cassette 
manifests an uninduced level of basal expression that does not result in a biological effect 
10 from a pharmacological dose of the second expression cassette. 

. 38. The inducible expression system of claim 1 wherein the first and second 
expression cassette are part of a single nucleic acid vector. 

15 39. The inducible expression system of claim 1 wherein the first expression cassette is 
part of a first nucleic acid vector and the second expression cassette is part of a second 
nucleic acid vector. 

40. A method of regulating expression of a transgene in vivo comprising: 
20 administrating a first expression cassette encoding a regulator protein comprising: 

a mutated GAL-4 DNA-binding domain, wherein the mutation is 

found within amino acids 54-93 of SEQ ID NO: 10; 
a transcription transregulatory domain, and 
a mutated ligand-binding domain of a progesterone receptor, 
25 and a second regulated expression cassette encoding a desked transgene, wherein 

transcription from the second regulated expression cassette follows activation of the 
regulator protein by in vivo administration of an inducer capable of binding to the mutated 
ligand-binding domain the progesterone receptor. 

30 41. The method of claim 40 wherein the amino acid sequence of the regulator protein 
is selected fiiom SEQ ID NOS: 15 and 24. 
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42. Hie method of claim 40 wherein the inducer is administered after inflammation 
associated with administration of the first and second expression cassettes has subsided. 

43. An inducible erythropoietin expression system comprising: 
5 a first plasmid having the sequence SEQ ID NO: 22, and 

a second plasmid having the sequence SEQ ID NO: 23. 

44. A composition comprising: 

a pharmacological dose of a first plasmid having a first expression cassette and a 
10 second plasmid having a second expression cassette, wherein the first expression cassette 
comprises a nucleic acid sequence encoding for a fusion protein having the following 
domains: 

a mutated GAL-4 DNA-binding domain; 
a transregulatory domain; and 
IS a mutated ligiaiid-binding domain of a steroid-hormone receptor; 

and wherein 

the second expression cassette comprises a gene driven by an promoter that is 
regulated by the fusion protein. 

20 45. The composition of claim 44 further comprising: 

a polymer selected from a group consisting of non-ionic polymers and anionic 
polymers. 

46. The composition of claim 45 wherein the anionic polymer is poly-L-glutamate. 

25 

47. The composition of claim 45 wherein the non-ionic polymer is selected from a 
group consisting of PVP, PVA, and poloxamer. 

48. The composition of claim 45 wherein the polymer, the first plasmid, and the 
30 second plasmid are co-lyophilized. 

49. A method of inducing the expression of a transgene, the method comprising: 



wo 02/24899 



PCT/USOl/30305 



58 

(a) introducing an inducible expression system of claim 1 into a cell; 
. (b) inducing the expression of the transgene by administering an inducer. 

50 The method of claim 49 wherein the introduction of the inducible expression 
5 system is by injection and electroporation. 

51. The method of claim 50 wherein the introduction of the inducible expression 
system is by injection into muscle followed by electroporation. 

52. The method in claim 49 wherein the expression of (he transgene is induced at least 
10 12 days after the introduction of Hie inducible expression system into the cell. 

53. The method in claim 49 wherein the expression of the transgene is induced at least 
18 days after the introduction of the inducible expression system into the cell. 

15 54. The method m claim 49 wherein the expression of the transgene is induced at least 
20 days after the introduction of the inducible expression system into the cell. 

55. The method in claim 49 wherein the expression of the transgene is induced at least 
50 days after the introduction of the inducible expression system into the cell. 

20 

56. The method in claim 49 wherein the expression of the transgene is induced at least 
54 days after the introduction of the inducible expression system into the cell. 

57. The method in claim 49 wherein expression of the transgene is induced using a 
25 pulsatile program of induction. 

58. The method in claim 49 further comprising the step of formulating the first 
expression cassette and the second expression cassette with a polymer selected from the 
group consisting of anionic and non-ionic polymer. 



59. The method of claim 58 wherein the anionic polymer is poly-L-glutamate. 
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60 The method of claim 58 wherein the polymer is selected from a group consisting of 
PVP, PVA, and poloxamer. 

61. The method of 59 wherein the introduction of the inducible expression system is 
5 by intra-muscular injection and electroporation. 

62. The method of claim 61 wherein the second nucleic acid comprises a gene selected 
from a group consisting of: erythropoeitin, a clotting factor and an interferon. 

10 63. A inducible expression system comprising: 

a first expression cassette having a first nucleic acid sequence encoding for a 
fusion protein comprising 

a GAL-4 DNA binding domain mutated to have a deletion of at least five amino 
acids in the C-terminal portion, 
15 a transregulatory domain, and a mutated ligand-binding domain of a progesterone 

receptor having a deletion of at least 19 amino acids in the C-terminal portion; and 

a second expression cassette having a second nucleic acid sequence encoding a 
desired product, wherein fusion protein is activated in the presence of an inducer and the 
expression of the second nucleic acid sequence is controlled by binding of the activated 
20 fusion protein on a promoter region of the second nucleic acid sequence. 

64. The inducible system of claim 63 wherein the GAL-4 DNA binding domain is 
mutated to have a deletion of at least 10 amino acids. 

25 65. The inducible system of claim 63 wherein the GAL-4 DNA binding domain is 
mutated to have a deletion of at least 10 amino acids. 
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Figure 7 

GeneSwitch coding sequence 

pGS1210/1382 - GeneSwitch coding sequence version 3. 1 (SEQ.ID.NO: 12) 

p6S1539/1598 - GeneSwitch coding sequence version 4.0 (GAL4 domain underlined) (SEQ.ID. NO: 13) 



{1210cds} 


1 

ATGGACTCCC 


AGCAGCCAGA 


TCTGAAGCTA 


50 

CTGTCTTCTA TCGAACAAGC 


{1539cds} 


ATGGACTCCC 


AGCAGCCAGA 


TCTGAAGCTA 


CTGTCTTCTA TCGAACAAGC 


{1210cds} 


51 

ATGCGATATT 


TGCCGACTTA 


AAAAGCTCAA 


100 

GTGCTCCAAA GAAAAACCGA 


{1539cds} 


ATGCGATATT 


TGCCGACTTA AAAAGCTCAA 


GTGCTCCAAA GAAAAACCGA 


{1210cds} 


101 

AGTGCGCCAA 


GTGTCTGAAG 


AACAACTGGG 


150 

AGTGTCGCTA CTCTCCCAAA 


{1539cds} 


AGTGCGCCAA 


GTGTCTGAAG 


AACAACTGGG 


AGTGTCGCTA CTCTCCCAAA 


{,1210cds} 


151 

ACCAAAAGGT 


CTCCGCTGAC 


TAGGGCACAT 


200 

CTGACAGAAG TGGAATCAAG 


{1539cds} 


ACCAAAAGGT 


CTCCGCTGAC 


TAGGGCACAT 


CTGACAGAAG TGGAATCAAG 


{1210cds} 


201 

GCTAGAAAGA 


CTGGAACAGC 


TATTTCTACT 


250 

GATTTTTCCT CGAGAAGACC 


{1539cds} 


GCTAGAAAGA 


CTGGAACAGC 


TATTTCTACT 


GATTTTTCCT CGA 


{1210cds} 


251 

TTGACATGAT 


TTTGAAAATG 


GATTCTTTAC 


300 

AGGATATAAA AGCATTGTTA 


{1539cds} 











301 350 
{1210cds} GAATTCCCGG GTGTCGACCA GAAAAAGTTC AATAAAGTCA GAGTTGTGAG 
{1539cds} GACCA GAAAAAGTTC AATAAAGTCA GAGTTGTGAG 

351 400 
{1210cds} AGCACTGGAT GCTGTTGCTC TCCCACAGCC AGTGGGCGTT CCAAATGAAA 
{1539cds} AGCACTGGAT GCTGTTGCTC TCCCACAGCC AGTGGGCGTT CCAAATGAAA 

401 450 
{1210cds} GCCAAGCCCT AAGCCAGAGA TTCACTTTTT CACCAGGTCA AGACATACAG 
{1539cds} GCCAAGCCCT AAGCCAGAGA TTCACTTTTT CACCAGGTCA AGACATACAG 

451 500 
{1210cds} TTGATTCCAC CACTGATCAA CCTGTTAATG AGCATTGAAC CAGATGTGAT 
{1539cds} TTGATTCCAC CACTGATCAA CCTGTTAATG AGCATTGl^C CAGATGTGAT 

501 550 
{1210cds> CTATGCAG6A CATGACAACA CAAAACCTGA CACCTCCA6T TCTTTGCTGA 
{1539cds} CTATGCAGGA CATGACAACA CAAAACCTGA CACCTCCAGT TCTTTGCTGA 
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551 600 
{1210cds} CAAGTCTTAA TCAACTAGGC GAGAGGCAAC TTCTTTCAGT AGTCAAGTGG 
{1539cds} CAAGTCTTAA TCAACTAGGC GAGAGGCAAC TTCTTTCAGT AGTCAAGTGG 

601 650 
{1210cds} TCTAAATCAT TGCCAGGTTT TCGAAACTTA CATATTGATG ACCAGATAAC 
{1539cds} TCTAAATCAT TGCCAGGTTT TCGAAACTTA CATATTGATG ACCAGATAAC 

651 700 
{1210cds} TCTCATTCAG TATTCTTGGA TGAGCTTAAT GGTGTTTGGT CTAGGATGGA 
{1539cds} TCTCATTCAG TATTCTTGGA TGAGCTTAAT GGTGTTTGGT CTAGGATGGA 

701 750 
{1210cds} GATCCTACAA ACACGTCAGT GGGCAGATGC TGTATTTTGC ACCTGATCTA 
{1539cds} GATCCTACAA ACACGTCAGT GGGCAGATGC TGTATTTTGC ACCTGATCTA 

751 800 
{1210cds} ATACTAAATG AACAGCGGAT GAAAGAATCA TCATTCTATT CATTATGCCT 
{1539cds} ATACTAAATG AACAGCGGAT GAAAGAATCA TCATTCTATT CATTATGCCT 

801 850 

{1210cds} TACCATGTGG CAGATCCCAC AGGAGTTTGT CAAGCTTCAA GTTAGCCAAG 
{1539cds} TACCATGTGG CAGATCCCAC AGGAGTTTGT CAAGCTTCAA GTTAGCCAAG 

851 900 
{1210cds} AAGAGTTCCT CTGTATGAAA GTATTGTTAC TTCTTAATAC AATTCCTTTG 
{1539cds} AAGAGTTCCT CTGTATGAAA GTATTGTTAC TTCTTAATAC AATTCCTTTG 

901 950 
{1210cds} GAAGGGCTAC GAAGTCAAAC CCAGTTTGAG GAGATGAGGT CAAGCTACAT 
{1539cds} GAAGGGCTAC GAAGTCAAAC CCAGTTTGAG GAGATGAGGT CAAGCTACAT 

951 1000 
{1210cds} TAGAGAGCTC ATCAAGGCAA TTGGTTTGAG GCAAAAAGGA GTTGTGTCGA 
{1539cds} TAGAGAGCTC ATCAAGGCAA TTGGTTTGAG GCAAAAAGGA GTTGTGTCGA 

1001 1050 
{1210cds} GCTCACAGCG TTTCTATCAA CTTACAAA?^C TTCTTGATAA CTTGCATGAT 
{1539cds} GCTCACAGCG TTTCTATCAA CTTACAAAAC TTCTTGATAA CTTGCATGAT 

1051 1100 
{1210cds} CTTGTCAAAC AACTTCATCT GTACTGCTTG AATACATTTA TCCAGTCCCG 
{1539cds} CTTGTCAAAC AACTTCATCT GTACTGCTTG AATACATTTA TCCAGTCCCG 

1101 1150 
{1210cds} GGCACTGAGT GTTGAATTTC CAGAAATGAT GTCTGAAGTT ATTGCTGGGT 
{1539cds} GGCACTGAGT GTTGAATTTC CAGAAATGAT GTCTGAAGTT ATTGCTGGGT 

1151 1200 
{1210cds} CGACGCCCAT GGAATTCCAG TACCTGCCAG ATACAGACGA TCGTCACCGG 
{1539cds} CGACGCCCAT GGAATTCCAG TACCTGCCAG ATACAGACGA TCGTCACCGG 
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1201 1250 
{1210cds} ATTGAGGAGA AACGTAAAAG GACATATGAG ACCTTCAAGA GCATCATGAA 
{1539cds} ATTGAGGAGA AACGTAAAAG GACATATGAG ACCTTCAAGA GCATCATGAA 

1251 1300 
{1210cds} GAAGAGTCCT TTCAGCGGAC CCACCGACCC CCGGCCTCCA CCTCGACGCA 
{1539cds} GAAGAGTCCT TTCAGCGGAC CCACCGACCC CCGGCCTCCA CCTCGACGCA 

1301 1350 
{1210cds} TTGCTGTGCC TTCCCGCAGC TCAGCTTCTG TCCCCAAGCC AGCACCCCAG 
{1539cds} TTGCTGTGCC TTCCCGCAGC TCAGCTTCTG TCCCCAAGCC AGCACCCCAG 

1351 1400 
{1210cds} CCCTATCCCT TTACGTCATC CCTGAGCACC ATCAACTATG ATGAGTTTCC 
{1539cds} CCCTATCCCT TTACGTCATC CCTGAGCACC ATCAACTATG ATGAGTTTCC 

1401 1450 
{1210cds} CACCATGGTG TTTCCTTCTG GGCAGATCAG CCAGGCCTCG GCCTTGGCCC 
{1539cds} CACCATGGTG TTTCCTTCTG GGCAGATCAG CCAGGCCTCG GCCTTGGCCC 

1451 1500 
{1210cds} CGGCCCCTCC CCAAGTCCTG CCCCAGGCTC CAGCCCCTGC CCCTGCTCCA 
{1539cds} CGGCCCCTCC CCAAGTCCTG CCCCAGGCTC CAGCCCCTGC CCCTGCTCCA 

1501 1550 
{1210cds} GCCATGGTAT CAGCTCTGGC CCAGGCCCCA GCCCCTGTCC CAGTCCTAGC 
{1539cds} GCCATGGTAT CAGCTCTGGC CCAGGCCCCA GCCCCTGTCC CAGTCCTAGC 

1551 1600 
{1210cds} CCCAGGCCCT CCTCAGGCTG TGGCCCCACC TGCCCCCAAG CCCACCCAGG 
{1539cds} CCCAGGCCCT CCTCAGGCTG TGGCCCCACC TGCCCCCAAG CCCACCCAGG 

1601 1650 
{1210cds} CTGGGGAAGG AACGCTGTCA GAGGCCCTGC TGCAGCTGCA GTTTGATGAT 
{1539cds} CTGGGGAAGG AACGCTGTCA GAGGCCCTGC .TGCAGCTGCA GTTTGATGAT 

1651 1700 
{1210cds} GAAGACCTGG GGGCCTTGCT TGGCAACAGC ACAGACCCAG CTGTGTTCAC 
{1539cds} GAAGACCTGG GGGCCTTGCT TGGCAACAGC ACAGACCCAG CTGTGTTCAC 

1701 1750 
{1210cds} AGACCTGGCA TCCGTCGACA ACTCCGAGTT TCAGCAGCTG CTGAACCAGG 
{1539cds} AGACCTGGCA TCCGTCGACA ACTCCGAGTT TCAGCAGCTG CTGAACCAGG 

1751 1800 
{1210cds} GCATACCTGT GGCCCCCCAC ACAACTGAGC CCATGCTGAT GGAGTACCCT 
{1539cds} GCATACCTGT GGCCCCCCAC ACAACTGAGC CCATGCTGAT GGAGTACCCT 

1801 1850 

{1210cds} GAGGCTATAA CTCGCCTAGT GACAGGGGCC CAGAGGCCCC CCGACCCAGC 
{1539cds} GAGGCTATAA CTCGCCTAGT GACAGGGGCC CAGAGGCCCC CCGACCCAGC 
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1851 1900 
{1210cds} TCCTGCTCCA CTGGGGGCCC CGGGGCTCCC CAATGGCCTC CTTTCAGGAG 
{1539cds} TCCTGCTCCA CTGGGGGCCC CGGGGCTCCC CAATGGCCTC CTTTCAGGAG 

1901 1950 
{1210cds} ATGAAGACTT CTCCTCCATT GCGGACATGG ACTTCTCAGC CCTGCTGAGT 
{1539cds} ATGAAGACTT CTCCTCCATT GCGGACATGG ACTTCTCAGC CCTGCTGAGT 



1951 1965 
{1210cds} CAGATCAGCT CCTAA 
{1539cds} CAGATCAGCT CCTAA 
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Figure 8 

GeneSwitch amino acid sequence 

pGS1210/1382 - GeneSwitch amino acid sequence v 3.1 (SEQ.ID. NO: 14) 

pGS1539/1598 - GeneSwitch amino acid sequence v 4.0 (GAL4 domain underlined) (SEQ.ID. NO: 15) 

1 50 
{1210} jygPSQQPDL KL LSSIEQACDI CRLKKLKCSK EKPKCAKCLK ISINWECRYSPK 
{1539} MDSQQPD LKL LSSIEQACDI CRLKKLKCSK EKPKCAKCLK NNWECRYSPK 

51 100 
{1210} TKRSPLTRAH LTEVESRLER LEQLFLLIFP REDLDMILKM DSLQDIKALL 
{1539} TKRSPLTRAH LTEVESRLER LEQLFLLIFP R 

101 150 
{1210} EFPGVDQKKF NKVRWRALD AVALPQPVGV PNESQALSQR FTFSPGQDIQ 
{1539} DQKKF NKVRWRALD AVALPQPVGV PNESQALSQR FTFSPGQDIQ 

151 200 
{1210} LIPPLINLLM SIEPDVIYAG HDNTKPDTSS SLLTSLNQLG ERQLLSWKW 
{1539} LIPPLINLLM SIEPDVIYAG HDNTKPDTSS SLLTSLNQLG ERQLLSWKW 

201 250 

{1210} SKSLPGFRNL HIDDQITLIQ YSWMSLMVFG LGWRSYKHVS GQMLYFAPDL 
{1539} SKSLPGFRNL HIDDQITLIQ YSVflMSLMVFG LGWRSYKHVS GQMLYFAPDL 

251 300 
{1210} ILNEQRMKES SFYSLCLTMW QIPQEFVKLQ VSQEEFLCMK VLLLLNTIPL 
{1539} ILNEQRMKES SFYSLCLTMW QIPQEFVKLQ VSQEEFLCMK VLLLLNTIPL 

301 350 
{1210} EGLRSQTQFE EMRSSYIREL IKAIGLRQKG WSSSQRFYQ LTKLLDNLHD 
{1539} EGLRSQTQFE EMRSSYIREL IKAIGLRQKG WSSSQRFYQ LTKLLDNLHD 

351 400 
{1210} LVKQLHLYCL NTFIQSRALS VEFPEMMSEV lAGSTPMEFQ YLPDTDDRHR 
{1539} LVKQLHLYCL NTFIQSRALS VEFPEMMSEV lAGSTPMEFQ YLPDTDDRHR 

401 450 
{1210} lEEKRKRTYE TFKSIMKKSP FSGPTDPRPP PRRIAVPSRS SASVPKPAPQ 
{1539} lEEKRKRTYE TFKSIMKKSP FSGPTDPRPP PRRIAVPSRS SASVPKPAPQ 

451 500 
{1210} PYPFTSSLST INYDEFPTMV FPSGQISQAS ALAPAPPQVL PQAPAPAPAP 
{1539} PYPFTSSLST INYDEFPTMV FPSGQISQAS ALAPAPPQVL PQAPAPAPAP 

501 550 
{1210} AMVSALAQAP AFVPVLAPGP PQAVAPPAPK PTQAGEGTLS EALLQLQFDD 
{1539} AMVSALAQAP APVPVLAPGP PQAVAPPAPK PTQAGEGTLS EALLQLQFDD 
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551 600 
{1210} EDLGALLGNS TDPAVFTDLA SVDNSEFQQL LNQGIPVAPH TTEPMLMEYP 
{1539} EDLGALLGNS TDPAVFTDLA SVDNSEFQQL LNQGIPVAPH TTEPMLMEYP 

601 650 

{1210} EAITRLVTGA QRPPDPAPAP LGAPGLPNGL LSGDEDFSSI ADMDFSALLS 
{1539} EAITRLVTGA QRPPDPAPAP LGAPGLPNGL LSGDEDFSSI ADMDFSALLS 

651 655 
{1210} QISS* 
{1539} QISS* 
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Figure 9A 
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Figure 9B 



Inducible 6X GAli4/TATA Promoter 



AAG|CGGAGTACTGTCCTCCG|AG|TGGAGTACTGTCCTCCG|^GiCGGAGTACTGTCCTCCG|AGTCG 



AGGGTCGAA( ^GGAGTACTGTCCTCC($ ^G frGGAGTACTGTCCTCCG[ AG iCGGAGTACTGTCCTCCG 
AGTCGACTCTAGAGGG^^^^TGGATCTCGAGATATCGGAGCT i CGTTTAGTGAACCGTC. . 

(SEQ.ID.NO:25) 



Figure 9C 



Sail TATA box EcoRV iia \ ^ \ BsmBI 

GTCGACTCTAGAGGGTATATiATGGATCTCGAGA TATCGGAGCTCCTTTAGTGAACCGTCAGATO gCCTKSGAG^ 
CAGCTGAGATCTCCCATATATTACCTAGAGCTCT ATAGCCTTOASCAAATCAgTOGGraCT^ 

t t 

Sac n TFBS 

ATCCACGCTGTTTTGACCTCCATAGAAGCACCGGGACCGATCCAGCCTCCGCGGCCGGGAACGGTGCATTGGAACGCG 
TAGGTGCGAa^AAACTG<3tf«WTATCTTCGTGGCCCax;GCTAGGTCGGAGGC<^ 

Pad 

GATTCCCCGTGTTAATTAA SEO IDJ^O" 17 

CTAAGGGGCACAATTAATT 
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Figure 10 



TATA &oj(yinitiator region (inr)/ l]T12 



Inducible Promoter Region 
pEP1556 -intact inducible promoter region: 

transcription factor binding site (UT12-TFBS) 
pEP1595 - SalUEcoRY deletion of TATA box pro moter region : im:/UT12-TFBS 
pEP1596 - EcoRV/BsmBl deletion of inr region: 
pEP1597 - SacTUPacI deletion of UT12 region: 



TATAboxfUTlT^TFBS 



TATA box fim 



1 50 
{pepl556} CCGAGTCGAC TCTAGAGGGT ATATAATGGA TCTCGAGATA TCGGAGCTCG 

{pepl595} CCGAGTCGA TCGGAGCTCG 

{pepl596} CCGAGTCGAC TCTAGAGGGT ATATAATGGA TCTCGAGAT 

{pepl597} CCGAGTCGAC TCTAGAGGGT ATATAATGGA TCTCGAGATA TCGGAGCTCG 

51 100 
{pepl556} TTTAGTGAAC CGTCAGATC G CCTGGAGACG CCATCCACGC TGTTTTGACC 
{pepl595} TTTAGTGAAC CGTCAGATC G CCTGGAGACG CCATCCACGC TGTTTTGACC 

{pepl596} CG CCTGGAGACG CCATCCACGC TGTTTTGACC 

{pepl597} TTTAGTGAAC CGTCAGAT CG CCTGGAGACG CCATCCACGC TGTTTTGACC 

101 150 
{pepl556} TCCATAGAAG ACACCGGGAC CGATCCAGCC TCC6CGGCC6 6GAAC6GTGC 
{pepl595} TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 66AACGGTGC 
{pepl596} TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 6GAACGGTGC 
{pepl597} TCCATAGAAG ACACCGGGAC CGATCCAGCC TCC 



151 184 
{pepl556} ATTGGAACGC GGATTCCCCG TGTTAATTAA CAGG 
{pepl595} ATTGGAACGC GGATTCCCCG TGTTAATTAA CAGG 
{pepl596} ATTGGAACGC GGATTCCCCG TGTTAATTAA CAGG 
{pepl597} TAA CAGG 



(SEQ. ID. NO: 18) 
(SEQ. ID. NO: 19) 
(SEQ. ID. NO: 20) 
(SEQ. ID. NO: 21) 



11/22 



wo 02/24899 



PCT/USOl/30305 



Figure 11 



Feature Map 
SK Promoter 

Start: 2 End: 440 
UT12 
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GeneSwitch CDS 
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Figure 12 



Feature Map 
6x GAL4 sites 

Start: 8 End: 137 
El b TATA 

Start: 147 End: 160 
UT12 

Start: 171 End: 274 
IVS8 

Start: 285 End: 402 
hEPO 

Start: 410 End: 1001 
hGHpA 

Start: 1028 End: 1217 
pUCori 

Start: 1286 End: 2072 
KanR 

Start: 2085 End: 2998 
KanR CDS 

Start: 2085 End: 2879 




pUCori 
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Figure 13 
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950-1774 
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1784- 
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1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
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1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 



ggggccgctc 
cctggccgca 
aggagccccg 
ccggctcccg 
ggcccggcgg 
gcagacgctc 
agcgccttct 
caaatatggc 
cgcccgcctc 
ggagacgcca 
tccagcctcc 
taattaacag 
tcaaccttcc 
actaacggtt 
atggactccc 
atgcffatatt 
agtgcgccaa 
accaaaaggt 
gctagaaaga 
aaaagttcaa 



tagctagagt 
cgctccctca 
ctgccccccg 
cacggccctg 
tgccggggga 
cttatacggc 
ttgggcagcg 
gacggccggg 
gataaaaggc 
tccacgctgt 
gcggccggga 
gtaagtgtct 
tatcagaaac 
ctttttttct 
agcagccaga 
tgccgactta 
gtgtctgaag 
ctGcgctgac 
ctggaaaagc 
taaagtcaga 



ctgcctgccc 
caggtgaagc 
aggcctcctc 
ggagaggccc 
ttatttcggc 
ccggcctcgc 
ccgggccggg 
gccgcattcc 
tccggggccg 
tttgacctcc 
acggtgcatt 
tcctcctgtt 
tgcagtatct 
cttcacaggc 
tctgaagcta 
aaaa^ctcaa 
aacaactggg 
tagggcacat 
tatttctace 
gttgtgagag 



cctgcctggc 
tcgaaaactc 
cctcacgcct 
ccaccgcttc 
cccggccccg 
tcacctgggc 
gccgcgccgg 
tgggggccgg 
gcgggcgact 
atagaagaca 
ggaacgcgga 
tccttcccct 
gtatttttgc 
caccaagcta 
ctgtcttcta 
gtgctccaaa 
agtgtcgcta 
Gtgacagaag 
gatttttcct 
cactggabgc 



acagcccgta 
cgtccccgta 
cgctgcgctc 
gtccttaacg 
ggggggcccg 
cgcggccagg 
gcccgacacc 
gcggtgctcc 
cagatcgcct 
ccgggaccga 
ttccccgtgt 
gctattctgc 
tagcagtaat 
ccggtccacc 
tcgaacaagc 
gaaaaaccga 
ctctcccaaa 
tggaatcaag 
cgagaccaga 
tgttgcbctc 



ccacagccag tgggcgttcc aaatgaaagc caagccctaa gccagagatt 
cactttttca ccaggtcaag acatacagtt gattccacca ctgatcaacc 
tgttaatgag cattgaacca gatgtgatct atgcaggaca tgacaacaca 
aaacctgaca cctccagttc tttgctgaca agtcttaatc aactaggcga 
gaggcaactt ctttcagtag tcaagtggtc taaatcattg ccaggttttc 
gaaacttaca tattgatgac cagataactc tcattcagta ttcbtggatg 
agcttaatgg tgtttggtct aggatggaga tcctacaaac acgtcagtgg 
gcagatgctg tattttgcac ctgatctaat actaaatgaa cagcggatga 
aagaatcatc attctattca ttatgcctta ccatgtggca gatcccacag 
gagtttgtca agcttcaagt tagccaagaa gagttcctct gtatgaaagt 
attgttactt cttaatacaa ttcctttgga agggctacga agtcaaaccc 
agtttgagga gatgaggtca agctacatta gagagctcat caaggcaatt 
ggtttgaggc aaaaaggagt tgtgtcgagc tcacagcgtt tctatcaact 
tacaaaactt cttgataact tgcatgatct tgtcaaacaa cttcatctgt 
actgcttgaa tacatttatc cagtcccggg cactgagtgt tgaatttcca 



gaaatgatgt ctgaagttat tgctg ggtcg 



cctgccagat 
catatgagac 
accgaccccc 
agcttctgtc 
tgagcaccat 
cagatcagcc 
ccaggctcca 
aggccccagc 
gccccacctg 
ggccctgctg 
gcaacagcac 
tccgagtttc 
aactgagccc 
caggggccca 
gggctcccca 
ggacatggac 
ccggactaga 



acagacgatc 
cttcaagagc 
ggcctccacc 
cccaagccag 
caactatgat: 
aggcctcggc 
gcccctgccc 
ccetgtccca 
cccccaagcc 
cagctgcagt 
agacccagct 
agcagctgct 
atgctgatgg 
gaggcccccc 
atggcctcct 
ttctcagccc 
aaagccgaat 



gtcaccggat 
atcatgaaga 
tcgacgcatt 
caccccagcc 
gagtttccca 
cttggccccg 
ctgctccagc 
gtectagccc 
cacccaggct 
ttgatgatga 
gtgttcacag 
gaaccagggc 
agtaccctga 
gacccagctc 
ttcaggagat 
tgctgagtca 
tctgcaggaa 



acgcccatgg 
tgaggagaaa 
agagtccttt 
gctgtgcctt 
ctatcccttt 
ccatggtgtt 
gcccctcccc 
catggtatca 
caggccctcc 
ggggaaggaa 
agacctgggg 
acctggcatc 
atacctgtgg 
ggctataact 
ctgctccact 
gaagacttct 
gat cage tec 
ttgggtggca 



aattccagta 
cgtaaaagga 
cagcggaccc 
cccgcagctc 
acgtcatccc 
tccttctggg 
aagtcctgcc 
gctctggccc 
tcaggctgtg 
cgctgtcaga 
gccttgcttg 
cgtcgacaac 
ccccccacac 
cgcctagtga 
gggggccccg 
cctecattgc 
taaggatcct 
tccctgtgac 



13/22 



wo 02/24899 



PCT/USOl/30305 



2651 ccctccccag tgcctctcct 

2701 cagccttgtc ctaataaaat 

2751 ccttctataa tattatgggg 

2801 gttgggaaga caacctgtag 

2851 ggtctcccta tagtgagtcg 

2901 attaatgaat cggccaacgc 

2951 tcttccgctt cctcgctcac 

3001 gcgagcggta tcagctcact 

3051 caggggataa cgcaggaaag 

3101 aggaaccgta aaaaggccgc 

3151 ccctgacgag catcacaaaa 

3201 cgacaggact ataaagatac 

3251 cgctctcctg ttccgaccct 

3301 cccttcggga agcgtggcgc 

3351 gttcggtgta ggtcgttcgc 

3401 gttcagcccg accgctgcgc 

3451 cccggtaaga cacgacttat 

3501 ttagcagagc gaggtatgta 

3551 cctaactacg gctacactag 

3601 gaagccagtt accttcggaa 

3651 aaaccaccgc tggtagcggt 

3701 cgcagaaaaa aaggatctca 

3751 tgacgctcag tggaacgaaa 

3801 cgcctaggct tttgcaaaga 

3851 gcatgattga acaagatgga 

3901 gagaggctat tcggctatga 

3951 tgccgccgtg ttccggctgt 

4001 agaccgacct gtccggtgcc 

4051 ctatcgtggc tggccacgac 

4101 tgtcactgaa gcgggaaggg 

4151 aggatctcct gtcatctcac 

4201 gctgatgcaa tgcggcggct 

4251 cgaccaccaa gcgaaacatc 

4301 ccggtcttgt cgatcaggat 

4351 ccagccgaac tgttcgccag 

4401 tctcgtcgtg acccatggcg 

4451 atggccgctt ttctggattc 

4501 cgctatcagg acatagcgtt 

4551 cggcgaatgg gctgaccgct 

4601 attcgcagcg catcgccttc 

4651 ggactctggg gttcgaaatg 

4701 cgagatttcg attccaccgc 

4751 cgttttccgg gacgccggct 

4801 tggagttctt cgcccaccct 

4851 atgtatttag aaaaataaac 

4901 aagtgccacc taaattgtaa 

4951 atttttgtta aatcagctca 

5001 atcccttata aatcaaaaga 

5051 agtttggaac aagagtccac 

5101 ggcgaaaaac cgtctatcag 

5151 taatcaagtt ttttggggtc 

5201 taaagggagc ccccgattta 

5251 cgagaaagga agggaagaaa 

5301 agtgtagcgg tcacgctgcg 

5351 gccgctacag ggcgcgtccc 

5401 aagggcgatc ggtgcgggcc 



ggccctggaa gttgccactc cagtgcccac 
taagttgcat cattttgtct gactaggtgt 
tggagggggg tggtatggag caaggggcaa 
ggctcgaggg ggggcccggt acgatctgcc 
tattaatttc gataagccag gttaacctgc 
gcggggagag gcggtttgcg tattgggcgc 
tgactcgctg cgctcggtcg ttcggctgcg 
caaaggcggt aatacggtta tccacagaat 
aacatgtgag caaaaggcca gcaaaaggcc 
gttgctggcg tttttccata ggctccgccc 
atcgacgctc aagtcagagg tggcgaaacc 
caggcgtttc cccctggaag ctccctcgtg 
gccgcttacc ggatacctgt ccgcctttct 
tttctcatag ctcacgctgt aggtatctca 
tccaagctgg gctgtgtgca cgaacccccc 
cttatccggt aactatcgtc ttgagtccaa 
cgccactggc agcagccact ggtaacagga 
ggcggtgcta cagagttctt gaagtggtgg 
aagaacagta tttggtatct gcgctctgct 
aaagagttgg tagctcttga tccggcaaac 
ggtttttttg tttgcaagca gcagattacg 
agaagatcct ttgatctttt ctacggggtc 
actcacgtta agggattttg gtcatgagcg 
tcgatcaaga gacaggatga ggatcgtttc 
ttgcacgcag gttctccggc cgcttgggtg 
ctgggcacaa cagacaatcg gctgctctga 
cagcgcaggg gcgcccggtt ctttttgtca 
ctgaatgaac tgcaagacga ggcagcgcgg 
gggcgttcct tgcgcagctg tgctcgacgt 
actggctgct attgggcgaa gtgccggggc 
cttgctcctg ccgagaaagt atccatcatg 
gcatacgctt gatccggcta cctgcccatt 
gcatcgagcg agcacgtact cggatggaag 
gatctggacg aagagcatca ggggctcgcg 
gctcaaggcg agcatgcccg acggcgagga 
atgcctgctt gccgaatatc atggtggaaa 
atcgactgtg gccggctggg tgtggcggac 
ggctacccgt gatattgctg aagagcttgg 
tcctcgtgct ttacggtatc gccgctcccg 
tatcgccttc ttgacgagtt cttctgagcg 
accgaccaag cgacgcccaa cctgccatca 
cgccttctat gaaaggttgg gcttcggaat 
ggatgatcct ccagcgcggg gatctcatgc 
aggcgcgctc atgagcggat acatatttga 
aaataggggt tccgcgcaca tttccccgaa 
gcgttaatat tttgttaaaa ttcgcgttaa 
ttttttaacc aataggccga aatcggcaaa 
atagaccgag atagggttga gtgttgttcc 
tattaaagaa cgtggactcc aacgtcaaag 
ggcgatggcc cactacgtga accatcaccc 
gaggtgccgt aaagcactaa atcggaaccc 
gagcttgacg gggaaagccg gcgaacgtgg 
gcgaaaggag cgggcgctag ggcgctggca 
cgtaaccacc acacccgccg cgcttaatgc 
attcgccatt caggctgcgc aactgttggg 
tcttcgctat tacgccagct ggcgaaaggg 
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5451 ggatgtgctg caaggcgatt aagttgggta acgccagggt tttcccagtc 

5501 acgacgttgt aaaacgacgg ccagtgagcg cgcgtaatac gactcactat 

5551 agggcgaatt gggtac (SEQ. ID. NO: 22) 
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Figure 14 

1 gggtcgaagc ggagtactgt cctccgagtg gagtactgtc ctccgagcgg 
51 agtactgtcc tccgagtcga gggtcgaagc ggagtactgt cctccgagtg 
101 gagtactgtc ctccgagcgg agtactgtcc tccgagtcga ctctagaggg 
151 tatataatgg atctcgagat gcctggagac gccatccacg ctgttttgac 
201 ctccatagaa gacaccggga ccgatccagc ctccgcggcc gggaacggtg 
251 cattggaacg cggattcccc gtgttaatta acaggtaagt gtcttcctcc 
301 tgtttccttc ccctgctatt ctgctcaacc ttcctatcag aaactgcagt 
351 atctgtattt ttgctagcag taatactaac ggttcttttt ttctcttcac 
401 aggccaccaa gcttccatflrflr gggtgcacga atgtcctgcc tggctgtggc 
451 tgctcctgtc cctgctgtcc ctccctctgg gcctcccagt cctgggcgcc 
501 ccaccacgcc tcatctgtga cagccgcgtc ctggagaggt atctcctgga 
miman 551 ggccaaggag gccgagaata tcacgacggg ctgtgctgaa cactgcagcc 
EPo tgaatgagaa tatcactgtc ccagacacca aagtgaattt ctatgcctgg 

Coding aagaggatgg aggtcgggca gcaggccgtg gaagtctggc agggcctggc 

Sequence cctgctgtcc gaagctgtcc tgcggggcca ggccctgctg gtcaactctt 

417-998 '^^^ cccagccgtg ggagcccctg cagctgcatg tggataaagc cgtcagtggc 
801 ctgcgcagcc tcaccactct gctgcgggct ctgggagccc agaaggaagc 
851 catctcccct ccagatgcgg cctccgctgc tccactccgc acaatcactg 
901 ctgacacttt ccgcaaactc ttccgagtct actccaattt cctccgggga 
951 aagctgaagc tgtacacagg ggaggcctgc aggacagggg acagatgagt 
1001 ctagaaaagc cgaattctgc aggaattggg tggcatccct gtgacccctc 
1051 cccagtgcct ctcctggccc tggaagttgc cactccagtg cccaccagcc 
1101 ttgtcctaat aaaattaagt tgcatcattt tgtctgacta ggtgtccttc 
1151 tataatatta tggggtggag gggggtggta tggagcaagg ggcaagttgg 
1201 gaagacaacc tgtagggctc gagggggggc ccggtaccag cttttgttcc 
1251 ctttagtgag ggttaatttc gagcttggtc ttccgcttcc tcgctcactg 
1301 actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 
1351 aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa 
1401 catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt. 
1451 tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 
1501 cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca 
1551 ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc 
1601 cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 
1651 tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 
1701 caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct 
1751 tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 
1801 ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg 
1851 cggtgctaca gagttcttga agtggtggcc taacfcacggc tacactagaa 
1901 ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 
1951 agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg 
2001 tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag 
2051 aagatccttt gatcttttct acggggtctg acgctcagaa gaactcgtca 
2101 agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta 
2151 aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat 
2201 cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 
2251 ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg 
2301 caagcaggca tcgccatgcg tcacgacgag atcctcgccg tcgggcatgc 
2351 gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct 
2401 tcgtccagat catcctgatc gacaagaccg gcttccatcc gagtacgtgc 
2451 tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat 
2501 caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 
2551 gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa 
2601 tagcagccag tcccttcccg cttcagtgac aacgtcgagc acagctgcgc 
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2651 aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc 
2701 agttcattca' gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg 
2751 cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct 
2801 gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 
2851 cctgcgtgca atccatcttg ttcaatcatg cgaaacgatc ctcatcctgt 
2901 ctcttgatca gatcttgatc ccctgcgcca tcagatcctt ggcggcaaga 
2951 aagccatcca gtttactttg cagggcttcc caaccttacc agagggcgcc 
3001 ccagctggca attccggttc gcttgctgtc cataaaaccg cccagtctag 
3051 caactgttgg gaagggcggg gctgcaggaa ttcgagcttg catgcctgca 
(SEQ.ID.NO:23) 
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Figure 17 
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Figure 18 
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Figure 19 



Figure 20 



Figure 21 
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Figure 22 
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SEQUENCE LISTING 



<110> VALENTIS, INC. 

<120> IMPROVED SYSTEM FOR REGULATION OF TRANSGENE EXPRESSION 

<130> 263/083 PCT 

<140> Not Yet Assigned 
<141> 2001-09-25 

<150> US 60/278,281 
<151> 2001-03-23 

<160> 25 

<170> Windows Notepad 

<210> 1 
<211> 9 . 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 5' splice site' 
sequence of a synthetic intron 



<210> 2 
<211> 9 
<212> RNA 

<213> Homo sapiens 
<400> 2 

guccauuca 9 

<210> 3 
<211> 7 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: branch point 
sequence for a synthetic intron 



<400> 1 
cagguaagu 



9 



<400> 3 



uacuaac 



7 
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<210> 4 
<211> .6 

<212> RJSIA 

<213> Homo sapiens 

<400> 4 

augaug 6 

<210> 5 
<211> 86 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: A general 
synthetic intron seqeuence where intronic 
sequences are represented by and N can be any 
base and can be longer or shorter than depicted 

<220> 

<221> intron 
<222> (10) . . (57) 

<400> 5 

caggtaagtn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nrmnnnnnnn nnnnnntact 60 
aacnnttctt tttttctctt cacagg 86 

<210> 6 
<211i^ 85 
<212> DNA" 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: Consensus 
intron sequence wherein intronic sequencs are 
represented by N, and N can be any base and can be 
longer or shorter than depicted 

<220> 

<221> intron 
<222> (10).. (58) 



<210> 7 
<211> 160 
<212> DNA 

<213> Artificial Sequence 
<220> 



<220> 



<220> 



<400> 6 

maggtragtn nnnnnnnnnn nnnnn] 
trayyyyyyy yyyyyyyyyy nyagg 



nnnnnnnnnn nnnnnnnnnn nnnnnnnyny 60 

85 
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<223> Description of Artificial Sequence: A synthetic 
intron termed IVS8 where exonic sequences are 
represented by N, N can be any base and can be 
longer or shorter 

<400> 7 

nnnnnnnnnn ttaattaaca ggtaagtgtc ttcctcctgt ttccttcccc ctgctattct 60 
gctcaacctt cctatcagaa actgcagtat ctgtattttt gctagcagtt atactaacgg 120 
ttcttttttt ctcttcacag gccaccatgg nnnnnnnnnn 160 



<210> 8 
<211> 110 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 5' Untranslated 
region derived from CMV 

<400> 8 

tcagatcgcc tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg 60 
atccagcctc cgcggccggg aacggtgcat tggaacgcgg attccccgtg 110 



<210> 9 
<211> 17 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 9 

cggaagactc tcctccg 17 



<210> 10 
<211> 93 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 10 

Met Lys Leu Leu Ser Ser lie Glu Gin Ala Cys Asp lie Cys Arg Leu 
1.5 10 15 

Lys Lys Leu Lys Cys Ser Lys Glu Lys Pro Lys Cys Ala Lys Cys Leu 
20 25 30 

Lys Asn Asn Trp Glu Cys Arg Tyr Ser Pro Lys Thr Lys Arg Ser Pro 
35 40 45 

Leu Thr Arg Ala His Leu Thr Glu Val Glu Ser Arg Leu Glu Arg Leu 
50 55 60 

Glu Gin Leu Phe Leu Leu He Phe Pro Arg Glu Asp Leu Asp Met He 
65 70 75 80 
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Leu Lys Met Asp Ser Leu Gin Asp lie Lys Ala Leu Leu 
85 90 



<210> 11 
<211> 28 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: General • 
sequence for GAL4 DNA recognition unit having 
cysteine-rich amino acid sequence forming two 
alpha helices that bind to two Zn ions 

<220> 

<221> HELIX 
<222> (1) . . (28) 

<400> 11 

Cys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 

Xaa Cys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys 
20 25 



<210> 12 
<211> 1965 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Coding Sequence 
for the GeneSwitch regulator protein v. 3.1 
comprising a GAL-4 DBD, a NFkBp65 tr ansae ti vat ion 
domain and a mutated hPR LBD 

<400> 12 

atggactccc agcagccaga tctgaagcta ctgtcttcta tcgaacaagc atgcgatatt 60 
tgccgactta aaaagctcaa gtgctccaaa gaaaaaccga agtgcgccaa gtgtctgaag 120 
aacaactggg agtgtcgcta ctctcccaaa accaaaaggt ctccgctgac tagggcacat 180 
ctgacagaag tggaatcaag gctagaaaga ctggaacagc tatttctact gatttttcct 240 
cgagaagacc ttgacatgat tttgaaaatg gattctttac aggatataaa agcattgtta 300 
gaattcccgg gtgtcgacca gaaaaagttc aataaagtca gagttgtgag agcactggat 360 
gctgttgctc tcccacagcc agtgggcgtt ccaaatgaaa gccaagccct aagccagaga 420 
ttcacttttt caccaggtca agacatacag ttgattccac cactgatcaa cctgttaatg 480 
agcattgaac cagatgtgat ctatgcagga catgacaaca caaaacctga cacctccagt 540 
tctttgctga caagtcttaa tcaactaggc gagaggcaac ttctttcagt agtcaagtgg 600 
tctaaatcat tgccaggttt tcgaaactta catattgatg accagataac tctcattcag 660 
tattcttgga tgagcttaat ggtgtttggt ctaggatgga gatcctacaa acacgtcagt 720 
gggcagatgc tgtattttgc acctgatcta atactaaatg aacagcggat gaaagaatca 780 
tcattctatt cattatgcct taccatgtgg cagatcccac aggagtttgt caagcttcaa 840 
gttagccaag aagagttcct ctgtatgaaa gtattgttac ttcttaatac aattcctttg 900 
gaagggctac gaagtcaaac ccagtttgag gagatgaggt caagctacat tagagagctc 960 
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atcaaggcaa ttggtttgag gcaaaaagga gttgtgtcga gctcacagcg tttctatcaa 1020 
cttacaaaac ttcttgataa cttgcatgat cttgtcaaac aacttcatct gtactgcttg 1080 
aatacattta tccagtcccg ggcactgagt gttgaatttc cagaaatgat gtctgaagtt 1140 
attgctgggt cgacgcccat ggaattccag tacctgccag atacagacga tcgtcaccgg 1200 
attgaggaga aacgtaaaag gacatatgag accttcaaga gcatcatgaa gaagagtcct 1260 
ttcagcggac ccaccgaccc ccggcctcca cctcgacgca ttgctgtgcc ttcccgcagc 1320 
tcagcttctg tccccaagcc agcaccccag ccctatccct ttacgtcatc cctgagcacc 1380 
atcaactatg atgagtttcc caccatggtg tttccttctg ggcagatcag ccaggcctcg 1440 
gccttggccc cggcccctcc ccaagtcctg ccccaggctc cagcccctgc ccctgctcca 1500 
gccatggtat cagctctggc ccaggcccca gcccctgtcc cagtcctagc cccaggccct 1560 
cctcaggctg tggccccacc tgcccccaag cccacccagg ctggggaagg aacgctgtca 1620 
gaggccctgc tgcagctgca gtttgatgat gaagacctgg gggccttgct tggcaacagc 1680 
acagacccag ctgtgttcac agacctggca tccgtcgaca actccgagtt tcagcagctg 1740 
ctgaaccagg gcatacctgt ggccccccac acaactgagc ccatgctgat ggagtaccct 1800 
gaggctataa ctcgcctagt gacaggggcc cagaggcccc ccgacccagc tcctgctcca 1860 
ctgggggccc cggggctccc caatggcctc ctttcaggag atgaagactt ctcctccatt 1920 
gcggacatgg acttctcagc cctgctgagt cagatcagct cctaa 1965 

<210> 13 
<211> 1893 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Coding Seq[uence ' 
for GeneSwitch Regulator protein v. 4.0 comprising 
a trxmcated GAL4 DBD, p65 transactivation domain 
and mutated hPR LBD 

<400> 13 

atggactccc agcagccaga tctgaagcta 
tgccgactta aaaagctcaa gtgctccaaa 
aacaactggg agtgtcgcta ctctcccaaa 
ctgacagaag tggaatcaag gctagaaaga 
cgagaccaga aaaagttcaa taaagtcaga 
ccacagccag tgggcgttcc aaatgaaagc 
ccaggtcaag acatacagtt gattccacca 
gatgtgatct atgcaggaca tgacaacaca 
agtcttaatc aactaggcga gaggcaactt 
ccaggttttc gaaacttaca tattgatgac 
agcttaatgg tgtttggtct aggatggaga 
tattttgcac ctgatctaat actaaatgaa 
ttatgcctta ccatgtggca gatcccacag 
gagttcctct gtatgaaagt attgttactt 
agtcaaaccc agtttgagga gatgaggtca 
ggtttgaggc aaaaaggagt tgtgtcgagc 
cttgataact tgcatgatct tgtcaaacaa 
cagtcccggg cactgagtgt tgaatttcca 
acgcccatgg aattccagta cctgccagat 
cgtaaaagga catatgagac cttcaagagc 
accgaccccc ggcctccacc tcgacgcatt 
cccaagccag caccccagcc ctatcccttt 
gagtttccca ccatggtgtt tccttctggg 
gcccctcccc aagtcctgcc ccaggctcca 



ctgtcttcta tcgaacaagc atgcgatatt 60 
gaaaaaccga agtgcgccaa gtgtctgaag 120 
accaaaaggt ctccgctgac tagggcacat 180 
ctggaacagc tatttctact gatttttcct 240 
gttgtgagag cactggatgc tgttgctctc 300 
caagccctaa gccagagatt cactttttca 360 
ctgatcaacc tgttaatgag cattgaacca 420 
aaacctgaca cctccagttc tttgctgaca 480 
ctttcagtag tcaagtggtc taaatcattg 540 
cagataactc tcattcagta ttcttggatg 600 
tcctacaaac acgtcagtgg gcagatgctg 660 
cagcggatga aagaatcatc attctattca 720 
gagtttgtca agcttcaagt tagccaagaa 780 
cttaatacaa ttcctttgga agggctacga 840 
agctacatta gagagctcat caaggcaatt 900 
tcacagcgtt tctatcaact tacaaaactt 960 
cttcatctgt actgcttgaa tacatttatc 1020 
gaaatgatgt ctgaagttat tgctgggtcg 1080 
acagacgatc gtcaccggat tgaggagaaa 1140 
atcatgaaga agagtccttt cagcggaccc 1200 
gctgtgcctt cccgcagctc agcttctgtc 1260 
acgtcatccc tgagcaccat caactatgat 1320 
cagatcagcc aggcctcggc cttggccccg 1380 
gcccctgccc ctgctccagc catggtatca 1440 
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gctctggccc aggccccagc ccctgtccca 
gccccacctg cccccaagcc cacccaggct 
cagctgcagt ttgatgatga agacctgggg 
gtgttcacag acctggcatc cgtcgacaac 
atacctgtgg ccccccacac aactgagccc 
cgcctagtga caggggccca gaggcccccc 
gggctcccca atggcctcct ttcaggagat 
ttctcagccc tgctgagtca gatcagctcc 



gtcctagccc caggccctcc tcaggctgtg 1500 
ggggaaggaa cgctgtcaga ggccctgctg 1560 
gccttgcttg gcaacagcac agacccagct 1620 
tccgagtttc agcagctgct gaaccagggc 1680 
atgctgatgg agtaccctga ggctataact 1740 
gacccagctc ctgctccact gggggccccg 1800 
gaagacttct cctccattgc ggacatggac 1860 
taa 1893 



<210> 14 
<211> 654 
<212> PRT . 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Amino acid 
sequence of GeneSwitch regulator protein v. 3.1 
comprising GAL4 DBD, p65 transactivation domain 
and mutated hPR LED 

<400> 14 

Met Asp Ser Gin Gin Pro Asp Leu Lys Leu Leu Ser, Ser lie Glu Gin 
15 10 15 

Ala Cys Asp He Cys Arg Leu Lys Lys Leu Lys Cys Ser Lys Glu Lys 
20 25 30 

Pro Lys Cys Ala Lys Cys Leu Lys Asn Asn Trp Glu Cys Arg Tyr Ser 
35 40 45 

Pro Lys Thr Lys Arg Ser Pro Leu Thr Arg Ala His Leu Thr Glu Val 
50 55 60 

Glu Ser Arg Leu Glu Arg Leu Glu Gin Leu Phe Leu Leu He Phe Pro 
65 70 75 80 

Arg Glu Asp Leu Asp Met He Leu Lys Met Asp Ser Leu Gin Asp He 
85 90 95 

Lys Ala Leu Leu Glu Phe Pro Gly Val Asp Gin Lys Lys Phe Asn Lys 
100 105 110 

Val Arg Val Val Arg Ala Leu Asp Ala Val Ala Leu Pro Gin Pro Val 
115 120 125 

Gly Val Pro Asn Glu Ser Gin Ala Leu Ser Gin Arg Phe Thr Phe Ser 
130 135 140 

Pro Gly Gin Asp He Gin Leu He Pro Pro Leu He Asn Leu Leu Met 
145 150 155 160 

Ser He Glu Pro Asp Val He Tyr Ala Gly His Asp Asn Thr Lys Pro 
165 170 175 
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Asp Thr Ser Ser Ser Leu Leu Thr Ser Leu Asn Gin Leu Gly Glu Arg 
180 185 190 

Gin Leu Leu Ser Val Val Lys Trp Ser Lys Ser Leu Pro Gly Phe Arg 
195 200 205 

Asn Leu His lie Asp Asp Gin lie Thr Leu lie Gin Tyr Ser Trp Met 
210 215 220 

Ser Leu Met Val Phe Gly Leu Gly Trp Arg Ser Tyr Lys His Val Ser 
225 230 235 240 

Gly Gin Met Leu Tyr Phe Ala Pro Asp Leu lie Leu Asn Glu Gin Arg 
245 250 255 

Met Lys Glu Ser Ser Phe Tyr Ser Leu Cys Leu Thr Met Trp Gin lie 
260 265 270 

Pro Gin Glu Phe Val Lys Leu Gin Val Ser Gin Glu Glu Phe Leu Cys 
275 280 285 

Met Lys Val Leu Leu Leu Leu Asn Thr lie Pro Leu Glu Gly Leu Arg 
290 295 300 

Ser Gin Thr Gin Phe Glu Glu Met Arg Ser Ser Tyr lie Arg Glu Leu 
305 310 315 320 

lie Lys Ala lie Gly Leu Arg Gin Lys Gly Val Val Ser Ser Ser Gin 
325 330 335 

Arg Phe Tyr Gin Leu Thr Lys Leu Leu Asp Asn Leu His Asp Leu Val 
340 345 350 

Lys Gin Leu His Leu Tyr Cys Leu Asn Thr Phe lie Gin Ser Arg Ala 
355 360 365 

Leu Ser Val Glu Phe Pro Glu Met Met Ser Glu Val lie Ala Gly Ser 
370 375 380 

Thr Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg His Arg 
385 390 395 400 

lie Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser lie Met 
405 410 415 

Lys Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro Pro Arg 
420 425 430 

Arg lie Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys Pro Ala 
435 440 445 

Pro Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr lie Asn Tyr Asp 
450 455 460 



Glu Phe Pro Thr Met Val Phe Pro Ser Gly Gin lie Ser Gin Ala Ser 
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465 470 475 480 

Ala Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro Ala Pro 
485 490 495 

Ala Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro Ala Pro 
500 505 510 

Val Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro Pro Ala 
515 520 525 

Pro Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala Leu Leu 
530 535 540 

Gin Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly Asn Ser 
545 550 555 560 

Thr Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn Ser Glu 
565 570 575 

Phe Gin Gin Leu Leu Asn Gin Gly lie Pro Val Ala Pro His Thr Thr 
580 585 590 

Glu Pro Met Leu Met Glu Tyr Pro Glu Ala lie Thr 2^g Leu Val Thr 
595 600 605 

Gly Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala Pro 
610 615 620 

Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser lie 
625 630 635 640 

Ala Asp Met Asp Phe Ser Ala Leu Leu Ser Gin lie Ser Ser 
645 650 



<210> 15 
<211> 630 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Amino acid 
sequence for GeneSwitch regulator protein v. 4.0 
comprising truncated GAL4 DBD, p65 transactivation 
domain and mutated hPR LBD. 



<400> 15 

Met Asp Ser Gin Gin Pro Asp Leu Lys Leu Leu Ser Ser lie Glu Gin 
15 10 15 

Ala Cys Asp lie Cys Arg Leu Lys Lys Leu Lys Cys Ser Lys Glu Lys 
20 25 30 

Pro Lys Cys Ala Lys Cys Leu Lys Asn Asn Trp Glu Cys Arg Tyr Ser 
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35 



40 



45 



Pro Lys Thr Lys Arg Ser Pro Leu Thr Arg Ala His Leu Thr Glu Val 
50 55 60 

Glu Ser Arg Leu Glu Arg Leu Glu Gin Leu Phe Leu Leu He Phe Pro 
65 70 75 80 

Arg Asp Gin Lys Lys Phe Asn Lys Val Arg Val Val Arg Ala Leu Asp 
85 90 95 

Ala Val Ala Leu Pro Gin Pro Val Gly Val Pro Asn Glu Ser Gin Ala 
100 105 110 

Leu Ser Gin Arg Phe Thr Phe Ser Pro Gly Gin Asp He Gin Leu He 
115 120 125 

Pro Pro Leu He Asn Leu Leu Met Ser He Glu Pro Asp Val He Tyr 
130 ^ 135 140 

Ala Gly His Asp Asn Thr Lys Pro Asp Thr Ser Ser Ser Leu Leu Thr 
145 150 155 160 

Ser Leu Asn Gin Leu Gly Glu Arg Gin Leu Leu Ser Val Val Lys Trp 
165 170 175 

Ser Lys Ser Leu Pro Gly Phe Arg Asn Leu His He Asp Asp Gin He 
180 185 190 

Thr Leu He Gin Tyr Ser Trp Met Ser Leu Met Val Phe Gly Leu Gly 
195 200 205 

Trp Arg Ser Tyr Lys His Val Ser Gly Gin Met Leu Tyr Phe Ala Pro 
210 215 220 

Asp Leu He Leu Asn Glu Gin Arg Met Lys Glu Ser Ser Phe Tyr Ser 
225 230 235 240 

Leu Cys Leu Thr Met Trp Gin He Pro Gin Glu Phe Val Lys Leu Gin 
245 250 255 

Val Ser Gin Glu Glu Phe Leu Cys Met Lys Val Leu Leu Leu Leu Asn 
260 265 270 

Thr He Pro Leu Glu Gly Leu Arg Ser Gin Thr Gin Phe Glu Glu Met 
275 280 285 

Arg Ser Ser Tyr He Arg Glu Leu He Lys Ala He Gly Leu Arg Gin 
290 295 300 

Lys Gly Val Val Ser Ser Ser Gin Arg Phe Tyr Gin Leu Thr Lys Leu 
305 310 315 320 

Leu Asp Asn Leu His Asp Leu Val Lys Gin Leu His Leu Tyr Cys Leu 



325 



330 



335 
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Asn Thr Phe lie Gin Ser Arg Ala Leu Ser Val Glu Phe Pro Glu Met 
340 345 350 

Met Ser Glu Val lie Ala Gly Ser Thr Pro Met Glu Phe Gin Tyr Leu 
355 360 365 

Pro Asp Thr Asp Asp Arg His Arg lie Glu Glu Lys Arg Lys Arg Thr 
370 375 380 

Tyr Glu Thr Phe Lys Ser lie Met Lys Lys Ser Pro Phe Ser Gly Pro 
385 390 395 400 

Thr Asp Pro Arg Pro Pro Pro Arg Arg lie Ala Val Pro Ser Arg Ser 
405 410 415 

Ser Ala Ser Val Pro Lys Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser 
420 425 430 

Ser Leu Ser Thr lie Asn Tyr Asp Glu Phe Pro Thr Met Val Phe Pro 
435 440 445 

Ser Gly Gin lie Ser Gin Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin 
450 455 460 

Val Leu Pro Gin Ala Pro Ala Pro Ala Pro Ala Pro Ala Met Val Ser 
465 470 475 480 

Ala Leu Ala Gin Ala Pro Ala Pro Val Pro Val Leu Ala Pro Gly Pro 
485 490 495 

Pro Gin Ala Val Ala Pro Pro Ala Pro Lys Pro Thr Gin Ala Gly Glu 
500 505 510 

Gly Thr Leu Ser Glu Ala Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp 
515 520 525 

Leu Gly Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala Val Phe Thr Asp 
530 535 540 

Leu Ala Ser Val Asp Asn Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly 
545 550 555 560 

lie Pro Val Ala Pro His Thr Thr Glu Pro Met Leu Met Glu Tyr Pro 
565 570 575 

Glu Ala lie Thr Arg Leu Val Thr Gly Ala Gin Arg Pro Pro Asp Pro 
580 585 590 

Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu Ser 
595 600 605 



Gly Asp Glu Asp Phe Ser Ser lie Ala Asp Met Asp Phe Ser Ala Leu 
610 615 620 
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Leu Ser Gin lie Ser Ser 
625 630 



<210> 16 
<211> 439 
<212> DNA 

<213> Gallus gallus 
<400> 16 

gggccgctct agctagagtc tgcctgcccc 
gctccctcac aggtgaagct cgaaaactcc 
ggcctcctcc ctcacgcctc gctgcgcfccc 
caccgcttcg tccttaacgg gcccggcggt 
gggggcccgg cagacgctcc ttatacggcc 
gcgccttctt tgggcagcgc cgggccgggg 
acggccgggg ccgcattcct gggggccggg 
ccggggccgg cgggcgact 



ctgcctggca cagcccgtac ctggccgcac 60 
gtccccgtaa ggagccccgc tgccccccga 120 
cggctcccgc acggccctgg gagaggcccc 180 
gccgggggat tatttcggcc ccggccccgg 240 
cggcctcgct cacctgggcc gcggccagga 300 
ccgcgccggg cccgacaccc aaatatggcg 360 
cggtgctccc gcccgcctcg ataaaaggct 420 

439 



<210> 17 
<211> 176 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sectuence: Core promoter 
region coir5>rising a TATA box, (16) to (23), a 
putative initiation region (inr) , (35) to (64), from 
the cm promoter, and a 5'UTR, (59) to (175), from 
CMV. 

<400> 17 

gtcgactcta gagggtatat aatggatctc gagatatcgg agctcgttta gtgaaccgtc 60 
agatcgcctg gagacgccat ccacgctgtt ttgacctcca • tagaagacac cgggaccgat 120 
ccagcctccg cggccgggaa cggtgcattg gaacgcggat tccccgtgtt aattaa 176 



<210> 18 
<211> 184 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pEPl556 -intact 
inducible promoter region comprising a TATA box, 
initiation region (inr) and a UT12 Transcription 
factor binding site (TFBS) 



<400> 18 

ccgagtcgac tctagagggt atataatgga 
cgtcagatcg cctggagacg ccatccacgc 
cgatccagcc tccgcggccg ggaacggtgc 
cagg 



tctcgagata tcggagctcg tttagtgaac 60 
tgttttgacc tccatagaag acaccgggac 120 
attggaacgc ggattccccg tgttaattaa 180 

184 
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<210> 19 
<211> 154 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pEP 1595 core 
promoter having a Sal 1/EcoRV deletion of the TATA 
box leaving only the inr and theUT-12 TFBS 
regions 

<400> 19 

ccgagtcgaa tcggagctcg tttagtgaac cgtcagatcg cctggagacg ccatccacgc 60 
tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg ggaacggtgc 120 
attggaacgc ggattccccg tgttaattaa cagg 154 



<210> 20 ' 
<211> 154 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pEP1596 core 
promoter having a EcoR/ BsmBI . deletion of the inr 
region leaving only the TATA box and the UT12-TFBS 

<400> 20 

ccgagtcgac tctagagggt atataatgga tctcgagatg cctggagacg ccatccacgc 60 
tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg ggaacggtgc 120 
attggaacgc ggattccccg tgttaattaa cagg 154 



<210> 21 
<211> 140 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pEP1597 core 

promoter having a SacII/PacI deletion of UT12 TFBS 
leaving only the TATA box and the inr region 

<400> 21 

ccgagtcgac tctagagggt atataatgga tctcgagata tcggagctcg tttagtgaac 60 
cgtcagatcg cctggagacg ccatccacgc tgttttgacc tccatagaag acaccgggac 120 
cgatccagcc tcctaacagg 140 



<210> 22 
<211> 5566 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Complete 
nucleic acid sequence of pGS 1694 plasmid 

<400> 22 • 

ggggccgctc tagctagagt ctgcctgccc cctgcctggc acagcccgta cctggccgca 60 
cgctccctca caggtgaagc tcgaaaactc cgtccccgta aggagccccg ctgccccccg 120 
aggcctcctc cctcacgcct cgctgcgctc ccggctcccg cacggccctg ggagaggccc 180 
ccaccgcttc gtccttaacg ggcccggcgg tgccggggga ttatttcggc cccggccccg 240 
ggggggcccg gcagacgctc cttatacggc ccggcctcgc tcacctgggc cgcggccagg 300 
agcgccttct ttgggcagcg ccgggccggg gccgcgccgg gcccgacacc caaatatggc 360 
gacggccggg gccgcattcc tgggggccgg gcggtgctcc cgcccgcctc gataaaaggc 420 
tccggggccg gcgggcgact cagatcgcct ggagacgcca tccacgctgt tttgacctcc 480 
atagaagaca ccgggaccga tccagcctcc gcggccggga acggtgcatt ggaacgcgga 540 
ttccccgtgt taattaacag gtaagtgtct tcctcctgtt tccttcccct gctattctgc 600 
tcaaccttcc tatcagaaac tgcagtatct gtatttttgc tagcagtaat actaacggtt 660 
ctttttttct cttcacaggc caccaagcta ccggtccacc atggactccc agcagccaga 720 
tctgaagcta ctgtcttcta tcgaacaagc atgcgatatt tgccgactta aaaagctcaa 780 
gtgctccaaa gaaaaaccga agtgcgccaa gtgtctgaag aacaactggg agtgtcgcta 840 
ctctcccaaa accaaaaggt ctccgctgac tagggcacat ctgacagaag tggaatcaag 900 
gctagaaaga ctggaacagc tatttctact gatttttcct cgagaccaga aaaagttcaa 960 
taaagtcaga gttgtgagag cactggatgc tgttgctctc ccacagccag tgggcgttcc 1020 
aaatgaaagc caagccctaa gccagagatt cactttttca ccaggtcaag acatacagtt 1080 
gattccacca ctgatcaacc tgttaatgag cattgaacca gatgtgatct atgcaggaca 1140 
tgacaacaca aaacctgaca cctccagttc tttgctgaca agtcttaatc aactaggcga 1200 
gaggcaactt ctttcagtag tcaagtggtc taaatcattg ccaggttttc gaaacttaca 1260 
tattgatgac cagataactc tcattcagta ttcttggatg agcttaatgg tgtttggtct 1320 
aggatggaga tcctacaaac acgtcagtgg gcagatgctg tattttgcac ctgatctaat 1380 
actaaatgaa cagcggatga aagaatcatc attctattca ttatgcctta ccatgtggca 1440 
gatcccacag gagtttgtca agcttcaagt tagccaagaa gagttcctct gtatgaaagt 1500 
attgttactt cttaatacaa ttcctttgga agggctacga agtcaaaccc agtttgagga 1560 
gatgaggtca agctacatta gagagctcat caaggcaatt ggtttgaggc aaaaaggagt 1620 
tgtgtcgagc tcacagcgtt tctatcaact tacaaaactt cttgataact tgcatgatct 1680 
tgtcaaacaa cttcatctgt actgcttgaa tacatttatc cagtcccggg cactgagtgt 1740 
tgaatttcca gaaatgatgt ctgaagttat tgctgggtcg acgcccatgg aattccagta 1800 
cctgccagat acagacgatc gtcaccggat tgaggagaaa cgtaaaagga catatgagac 1860 
cttcaagagc atcatgaaga agagtccttt cagcggaccc accgaccccc ggcctccacc 1920 
tcgacgcatt gctgtgcctt cccgcagctc agcttctgtc cccaagccag caccccagcc 1980 
ctatcccttt acgtcatccc tgagcaccat caactatgat gagtttccca ccatggtgtt 2040 
tccttctggg cagatcagcc aggcctcggc cttggccccg gcccctcccc aagtcctgcc 2100 
ccaggctcca gcccctgccc ctgctccagc catggtatca gctctggccc aggccccagc 2160 
ccctgtccca gtcctagccc caggccctcc tcaggctgtg gccccacctg cccccaagcc 2220 
cacccaggct ggggaaggaa cgctgtcaga ggccctgctg cagctgcagt ttgatgatga 2280 
agacctgggg gccttgcttg gcaacagcac agacccagct gtgttcacag acctggcatc 2340 
cgtcgacaac tccgagtttc agcagctgct gaaccagggc atacctgtgg ccccccacac 2400 
aactgagccc atgctgatgg agtaccctga ggctataact cgcctagtga caggggccca 2460 
gaggcccccc gacccagctc ctgctccact gggggccccg gggctcccca atggcctcct 2520 
ttcaggagat gaagacttct cctccattgc ggacatggac ttctcagccc tgctgagtca 2580 
gatcagctcc taaggatcct ccggactaga aaagccgaat tctgcaggaa ttgggtggca 2640 
tccctgtgac ccctccccag tgcctctcct ggccctggaa gttgccactc cagtgcccac 2700 
cagccttgtc ctaataaaat taagttgcat cattttgtct gactaggtgt ccttctataa 2760 
tattatgggg tggagggggg tggtatggag caaggggcaa gttgggaaga caacctgtag 2820 
ggctcgaggg ggggcccggt acgatctgcc ggtctcccta tagtgagtcg tattaatttc 2880 
gataagccag gttaacctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg 2940 
tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg 3000 
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gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa 3060 
cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc 3120 
gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc 3180 
aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag 3240 
ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct 3300 
cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta 3360 
ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc 3420 
cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc 3480 
agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt 3540 
gaagtggtgg cctaactacg gctacactag aagaacagta . tttggtatct gcgctctgct 3600 
gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc 3660 
tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca 3720 
agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta 3780 
agggattttg gtcatgagcg cgcctaggct tttgcaaaga tcgatcaaga gacaggatga 3840 
ggatcgtttc gcatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg 3900 
gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg 3960 
ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc 4020 
ctgaatgaac tgcaagacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct 4080 
tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa 4140 
gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg 4200 
gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa 4260 
gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat 4320 
gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg 4380 
agcatgcccg acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc 4440 
atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac 4500 
cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg 4560 
gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc 4620 
tatcgccttc ttgacgagtt cttctgagcg ggactctggg gttcgaaatg accgaccaag 4680 
cgacgcccaa cctgccatca cgagatttcg attccaccgc cgccttctat gaaaggttgg 4740 
gcttcggaat cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatctcatgc 4800 
tggagttctt cgcccaccct aggcgcgctc atgagcggat acatatttga atgtatttag 4860 
aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc taaattgtaa 4920 
gcgttaatat tttgttaaaa ttcgcgttaa atttttgtta aatcagctca ttttttaacc 4980 
aataggccga aatcggcaaa atcccttata aatcaaaaga atagaccgag atagggttga 5040 
gtgttgttcc agtttggaac aagagtccac tattaaagaa cgtggactcc aacgtcaaag 5100 
ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga accatcaccc taatcaagtt 5160 
ttttggggtc gaggtgccgt aaagcactaa atcggaaccc taaagggagc ccccgattta 5220 
gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga agggaagaaa gcgaaaggag 5280 
cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg cgtaaccacc acacccgccg 5340 
cgcttaatgc gccgctacag ggcgcgtccc attcgccatt caggctgcgc aactgttggg 5400 
aagggcgatc ggtgcgggcc tcttcgctat tacgccagct ggcgaaaggg ggatgtgctg 5460 
caaggcgatt aagttgggta acgccagggt tttcccagtc acgacgttgt aaaacgacgg 5520 
ccagtgagcg cgcgtaatac gactcactat agggcgaatt gggtac 5566 



<210> 23 
<211> 3100 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Complete 
nucleic acid sequence of pEP1666 plasmid. 

<400> 23 
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gggtcgaagc ggagtactgt cctccgagtg gagtactgtc ctccgagcgg agtactgtcc 60 
tccgagtcga gggtcgaagc ggagtactgt cctccgagtg gagtactgtc ctccgagcgg 120 
agtactgtcc tccgagtcga ctctagaggg tatataatgg atctcgagat gcctggagac 180 
gccatccacg ctgttttgac ctccatagaa gacaccggga ccgatccagc ctccgcggcc 240 
gggaacggtg cattggaacg cggattcccc gtgttaatta acaggtaagt gtcttcctcc 300 
tgtttccttc ccctgctatt ctgctcaacc ttcctatcag aaactgcagt atctgtattt 360 
ttgctagcag taatactaac ggttcttttt ttctcttcac aggccaccaa gcttccatgg 420 
gggtgcacga atgtcctgcc tggctgtggc tgctcctgtc cctgctgtcc ctccctctgg 480 
gcctcccagt cctgggcgcc ccaccacgcc tcatctgtga cagccgcgtc ctggagaggt 540 
atctcctgga ggccaaggag gccgagaata tcacgacggg ctgtgctgaa cactgcagcc 600 
tgaatgagaa tatcactgtc ccagacacca aagtgaattt ctatgcctgg aagaggatgg 660 
aggtcgggca gcaggccgtg gaagtctggc agggcctggc cctgctgtcc gaagctgtcc 720 
tgcggggcca ggccctgctg gtcaactctt cccagccgtg ggagcc'cctg cagctgcatg 780 
tggataaagc cgtcagtggc ctgcgcagcc tcaccactct gctgcgggct ctgggagccc 840 
agaaggaagc catctcccct ccagatgcgg cctccgctgc tccactccgc acaatcactg 900 
ctgacacttt ccgcaaactc ttccgagtct actccaattt cctccgggga aagctgaagc 960 
tgtacacagg ggaggcctgc aggacagggg acagatgagt ctagaaaagc cgaattctgc 1020 
aggaattggg tggcatccct gtgacccctc cccagtgcct ctcctggccc tggaagttgc 1080 
cactccagtg cccaccagcc ttgtcctaat aaaattaagt tgcatcattt tgtctgacta 1140 
ggtgtccttc tataatatta tggggtggag gggggtggta tggagcaagg ggcaagttgg 1200 
gaagacaacc tgtagggctc gagggggggc ccggtaccag cttttgttcc ctttagtgag 1260 
ggttaatttc gagcttggtc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 1320 
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 1380 
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 1440 
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 1500 
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 1560 
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 1620 
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 1680 
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 1740 
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 1800 
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 1860 
gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc 1920 
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 1980 
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 2040 
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagaa gaactcgtca 2100 
agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg 2160 
aagcggtcag cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg 2220 
tcctgatagc ggtccgccac acccagccgg ccacagtcga tgaatccaga aaagcggcca 2280 
ttttccacca tgatattcgg caagcaggca tcgccatgcg tcacgacgag atcctcgccg 2340 
tcgggcatgc gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct 2400 
tcgtccagat catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg 2460 
cgatgtttcg cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc 2520 
attgcatcag ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc 2580 
tgccccggca cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtcgagc 2640 
acagctgcgc aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc 2700 
agttcattca gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct 2760 
gacagccgga acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg 2820 
aatagcctct ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg 2880 
cgaaacgatc ctcatcctgt ctcttgatca gatcttgatc ccctgcgcca tcagatcctt 2940 
ggcggcaaga aagccatcca gtttactttg cagggcttcc caaccttacc agagggcgcc 3000 
ccagctggca attccggttc gcttgctgtc cataaaaccg cccagtctag caactgttgg 3060 
gaagggcggg gctgcaggaa ttcgagcttg catgcctgca 3100 



<210> 24 



15 



I 



wo 02/24899 PCT/USOl/30305 

<211> 630 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: General amino 
acid sequence of a truncated GAL4 DBD, hPR LBD 
separated by X and X can be shorter/ longer than 
depicted 

<220> 

<221> DOMAIN 
<222> (9) . . (81) 

<400> 24 

Met Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Leu Leu Ser Ser lie Glu Gin 
15 10 15 

Ala Cys Asp lie Cys Arg Leu Lys Lys Leu Lys Cys Ser Lys Glu Lys 
20 25 30 

Pro Lys Cys Ala Lys Cys Leu Lys Asn Asn Trp Glu Cys Arg Tyr Ser 
35 40 45 

Pro Lys Thr Lys Arg Ser Pro Leu Thr Arg Ala His Leu Thr Glu Val 
50 55 60 

Glu Ser Arg Leu Glu Arg Leu Glu Gin Leu Phe Leu Leu lie Phe Pro 
65 70 75 80 

Arg Xaa Xaa Lys Lys Phe Asn Lys Val Arg Val Val Arg Ala Leu Asp 
85 90 95 

Ala Val Ala Leu Pro Gin Pro Val Gly Val Pro Asn Glu Ser Gin Ala 
100 105 110 

Leu Ser Gin Arg Phe Thr Phe Ser Pro Gly Gin Asp lie Gin Leu lie 
115 120 125 

Pro Pro Leu lie Asn Leu Leu Met Ser lie Glu Pro Asp Val lie Tyr 
130 135 140 

Ala Gly His Asp Asn Thr Lys Pro Asp Thr Ser Ser Ser Leu Leu Thr 
145 150 155 160 

Ser Leu Asn Gin Leu Gly Glu Arg Gin Leu Leu Ser Val Val Lys Trp 
165 170 175 

Ser Lys Ser Leu Pro Gly Phe Arg Asn Leu His lie Asp Asp Gin lie 
180 185 190 

Thr Leu lie Gin Tyr Ser Trp Met Ser Leu Met Val Phe Gly Leu Gly 
195 200 205 

Trp Arg Ser Tyr Lys His Val Ser Gly Gin Met Leu Tyr Phe Ala Pro 
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210 215 220 

Asp Leu lie Leu Asn Glu Gin Arg Met Lys Glu Ser Ser Phe Tyr Ser 
225 230 235 240 

Leu Cys Leu Thr Met Trp Gin lie Pro Gin Glu Phe Val Lys Leu Gin 
245 250 255 

Val Ser Gin Glu Glu Phe Leu Cys Met Lys Val Leu Leu Leu Leu Asn 
260 265 270 

Thr lie Pro Leu Glu Gly Leu Arg Ser Gin Thr Gin Phe Glu Glu Met 
275 280 285 

Arg Ser Ser Tyr He Arg Glu Leu He Lys Ala He Gly Leu Arg Gin 
290 295 300 

Lys Gly Val Val Ser Ser Ser Gin Arg Phe Tyr Gin Leu Thr Lys Leu 
305 310 315 320 

Leu Asp Asn Leu His Asp Leu Val Lys Gin Leu His Leu Tyr Cys Leu 
325 330 335 

Asn Thr Phe He Gin Ser Arg Ala Leu Ser Val Glu Phe Pro Glu Met 
340 345 350 

Met Ser Glu Val He Ala Xaa Xaa Xaa Pro Met Glu Phe Gin Tyr Leu 

355 360 365 

Pro Asp Thr Asp Asp Arg His Arg He Glu Glu Lys Arg Lys Arg Thr 
370 375 380 

Tyr Glu Thr Phe Lys Ser He Met Lys Lys Ser Pro Phe Ser Gly Pro 
385 390 395 . 400 

Thr Asp Pro Arg Pro Pro Pro Arg Arg He Ala Val Pro Ser Arg Ser 
405 410 415 

Ser Ala Ser Val Pro Lys Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser 
420 425 430 

Ser Leu Ser Thr He Asn Tyr Asp Glu Phe Pro Thr Met Val Phe Pro 
435 440 445 

Ser Gly Gin He Ser Gin Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin 
450 455 460 

Val Leu Pro Gin Ala Pro Ala Pro Ala Pro Ala Pro Ala Met Val Ser 
465 470 475 480 

Ala Leu Ala Gin Ala Pro Ala Pro Val Pro Val Leu Ala Pro Gly Pro 
485 490 495 

Pro Gin Ala Val Ala Pro Pro Ala Pro Lys Pro Thr Gin Ala Gly Glu 
500 505 510 
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Gly Thr Leu Ser Glu Ala Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp 
515 520 525 

Leu Gly Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala Val Phe Thr Asp 
530 535 540 

Leu Ala Ser Val Asp Asn Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly 
545 550 555 560 

lie Pro Val Ala Pro His Thr Thr Glu Pro Met Leu Met Glu Tyr Pro 
565 ' 570 575 

Glu Ala lie Thr Arg Leu Val Thr Gly Ala Gin Arg Pro Pro Asp Pro 
580 585 590 

Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu Ser 
595 600 605 

Gly Asp Glu Asp Phe Ser Ser lie Ala Asp Met Asp Phe Ser Ala Leu 
610 615 62-0 

Leu Ser Gin lie Ser Ser 
625 630 



<210> 25 
<211> 189 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: six repeating 
GAL-4 DNA binding site and a TATA box 

<400> 25 

aagcggagta ctgtcctccg agtggagtac tgtcctccga gcggagtact gtcctccgag 60 
tcgagggtcg aagcggagta ctgtcctccg agtggagtac tgtcctccga gcggagtact 120 
gtcctccgag tcgactctag agggtatata atggatctcg agatatcgga gctcgtttag 180 
tgaaccgtc 189 
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